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HUMANIZED ANTIBODIES AGAINST 
VASCULAR ENDOTHELIAL GROWTH FACTOR 

CROSS REFERENCE TO RELATED APPLICATIONS 

5 

This application is a continuation-in-part of U.S. Application No. 10/443,134, 
filed May 20, 2003, entitled "Generation and Selection of Protein Library in Silico" 
which is a continuation-in-part of U.S. Application No. 10/153;159, filed May 20, 2002, 
entitled "Structure-Based Selection And Affinity Maturation of Antibody Library, and 

10 also a continuation-in-part of Application No. 10/153,176, filed May 20, 2002, entitled 
"Generation Affinity Maturation of Antibody Library in Silico", both of which are a 
continuation-in-part of U.S. Patent Application Serial No: 10/125,687 entitled "Structure- 
based construction of human antibody library" filed April 17, 2002, which claims the 
benefit of U.S. Provisional Application Serial No: 60/284,407 entitled "Structure-based 

15 construction of human antibody library" filed April 17, 2001. These applications are 
incorporated herein by reference. 

BACKGROUND OF THE INVENTION 

20 The present invention provides antibodies against human antigens and methods 

for generating high affinity antibodies against these targets. More particularly, the 
invention provides humanized or human antibodies against vascular endothelial growth 
factor (VEGF) and methods for generating such anti-VEGF antibodies. In addition, the 
invention provides compositions, kits and methods of using these antibodies and 

25 derivatives thereof to inhibit angiogenesis in vitro, and for diagnosing or treating diseases 
associated with abnormal angiogenesis such as cancer, rheumatoid arthritis, ischemic- 
reperfusion related brain edema and injury, cortical ischemia, ovarian hyperplasia and 
hypervascularity, endometriosis, psoriasis, diabetic retinopaphy, and other ocular 
angiogenic diseases. 

30 Angiogenesis has been involved in many physiological and pathological 

processes. Angiogenesis consists of multiple steps that ultimately resulting in 
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proliferation and differentiation of endothelial cells, and formation of tubes and cavities 
(angiogenesis). Factors that promote angiogenesis include VEGF, aFGF, bFGF, TGF- a 
, TGF- 0 , HGF, TNF- a 9 angiogenin, IL-8, etc., whereas factors that inhibit angiogenesis 
include thrombospondin (Good et al. Proc. Natl Acad. Sci. USA. 87:6624-6628 (1990)), 
5 the N-terminal fragment of prolactin (Clapp et al. Endocrinology, 133:1292-1299 

(1993)), kringle 5 domain of plasminogen (Cao et al, J. Biol. Chem., 271:29461-29467 
(1996), angiostatin (O'Reilly et al. Cell 79:315-328 (1994)) and endostatin (O'Reilly et 
al. Cell 88:277-285 (1996)). 

Vascular endothelial cell growth factor (VEGF) is a growth factor acting 

10 specifically with its receptors on vascular endothelial cells to promote their angiogenesis. 
It is known that angiogenesis plays an important role in the development of new 
vasculature from preexisting blood vessels and/or circulating endothelial stem cells 
(Asahara et al., Science, 275(5302):964-967, 1997; Springer et al., Mol. Cell, 2(5):549- 
558, 1998; Folkman and Shing, J. Biol. Chem., 267:10931-10934, 1992). Angiogenesis . 

15 also plays a vital role in many physiological processes, such as embryogenesis, wound 
healing and menstruation. More importantly, angiogenesis is further involved in 
pathological conditions such as tumor formation, metastasis, diabetic retinopathy, etc. It 
is known that the growth of a solid tumor requires tumor vascularization for supplying 
oxygen and nutrients and the metastasis of tumor cells occurs through blood vessels. 

20 resulting from the tumor vascularization. VEGF is believed to be a pivotal angiogenic 
factor in this vascularization for tumors. Therefore, it is expected that the growth and 
metastasis of tumor can be inhibited by certain substances neutralizing the 
vascularization activity of VEGF. Recent studies (Burrows and Thorpe, Pharmacol. 
Ther., 64:155-174, 1994; Proc. Natl. Acad. Sci. USA, 90:8996-9000, 1994) have used 

25 such a strategy to target the vasculature of solid tumors. Targeting the blood vessels of 
the tumors, rather than the tumor cells themselves, has certain advantages in that it is not 
likely to lead to the development of resistant tumor cells, and that the targeted cells are 
readily accessible. Moreover, destruction of the blood vessels leads to an amplification of 
the anti-tumor effect, as many tumor cells rely on a single vessel for their oxygen and 

30 nutrients 
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In nearly half of diabetics diabetic retinopathy occurs as one of complications of 
diabetes. It is believed that the formation of microcapillaries is promoted in diabetic 
retinopathy by oxygen deficiency. These microcapillaries will sooner or later be ruptured 
to bleed to form scar tissue, leading to detached retinas. Age-related macular 
5 degeneration is another eye disease that has been demonstrated to be involved in 
pathological vascularization in the retina. Hence, it is expected that inhibition of 
vascularization can prevent retinopathy from developing. Based on experiments using 
monkey, Miller et al. reported that VEGF is related very closely to the development of 
vegetative retinopathy (Miller et al.: Am. J. Pathol. 145, 574-584 (1994)). For this reason, 

10 a substance neutralizing the vascularization activity of VEGF is considered useful for 

preventing or treating diabetic retinopathy and AMD (Lopez et al. Invest. Ophtalmo. Vis. 
Sci. 37:855-868 (1996)). 

The recognition of VEGF as a primary stimulus of angiogenesis in pathological 
conditions has led to various attempts to block VEGF activity. Inhibitory anti-VEGF 

15 receptor antibodies, soluble receptor constructs, antisense strategies, RNA aptamers 
against VEGF and low molecular weight VEGF receptor tyrosine kinase (RTK) 
inhibitors have all been proposed for use in interfering with VEGF signaling (Siemeister 
et al., Cancer Metastasis Rev., 17(2):241-248., 1998). In fact, monoclonal antibodies 
against VEGF have been shown to inhibit human tumor xenograft growth and ascites 

20 formation in mice (Kim et al., Growth Factors 7:53 (1992); Nature 362:841-844 (1993); 
Asano et al., Hybridoma, 17:185-90, (1998); Mesiano et al., Am. J. Pathol., 153(4):1249- 
1256, (1998); Luo et al, Cancer Res., 58(12):2594-2600, (1998); Cancer Res., 
58(12):2652-2660, (1998); Borgstrom et al. Cancer Res. 56:4032-4039 (1996); 
Borgstrom et al., Prostate, 35(1): 1-10, (1998)). Furthermore, the same strategy using 

25 anti-angiogenic molecules, anti-VEGF antibody, and VEGF antagonists have been 

utilized in experimental treatment of AMD and diabetic retinopathy (Adamis et al. Arch. 
Ophthalmol. 1 14:66-71 (1996)). For the therapeutic applications, antibodies are 
generally engineered to reduce their toxicities in repeated dosage by humanization, if 
they are derived originally from mouse and to improve other attributes such as binding 

30 affinity with the target molecules by affinity maturation (Winter and Milstein, Nature, 
349:293-299, (1991); Baca et al, J. Biol. Chem., 272(16):10678-84, (1997); Presta, et al, 
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Cancer Res., 57:4593-4599, (1997); Chen et al. (1999) J. Mol. Biol. 293:865-881; and 
Ryan et al. (1999) Toxicologic Pathology, 27(l):78-86). 

Although the foregoing studies underscore the importance of VEGF in solid 
tumor growth, and its potential as a target for tumor therapy, the identification of 
additional agents that inhibit VEGF-induced angiogenesis would be of benefit in 
expanding the number of therapeutic options. The development of therapeutic agents that 
specifically inhibit VEGF to bind with its receptor represents important alternatives to 
target angiogenesis more effective with potentially improved therapeutic benefits. 
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SUMMARY OF THE INVENTION 

The present invention provides an innovative methodology for engineering 
proteins with desired structures and functions, especially for engineering antibodies with 
5 desirable properties from a therapeutic perspective, including high binding affinity for the 
target antigen, ability to effectively inhibit abnormal cell proliferation in vitro and in 
vivo, and minimal toxicity or side effects. 

In one aspect of the invention, methods are provided for designing and selecting 
antibodies against human antigens with high affinity and specificity in silico and in vitro. 
10 In some particular embodiments, methods are provided for designing and selecting 

humanized or fully human antibodies against vascular endothelial growth factor (VEGF) 
with high affinity and specificity. 

In another aspect of the invention, monoclonal antibodies against VEGF are 
provided. In particular, humanized or human anti-VEGF monoclonal antibodies are 
15 provided that can bind to human VEGF with high affinity. Preferably, these antibodies 
can inhibit VEGF-induced proliferation of endothelial cells in vitro and inhibit VEGF- 
induced angiogenesis in vivo. These antibodies and their derivative can be used in a wide 
variety of applications such as diagnosis, prevention, and treatment of diseases such as 
cancer, AMD, diabetic retinopathy, and other diseases derived from pathological 
20 angiogenesis. 

In one embodiment, a monoclonal antibody is provided that specifically binds to a 
human VEGF with dissociation constant K<j equal to or lower than 0.2 nM, optionally 
lower than 0.1 nM, optionally lower than 0.08 nM, optionally lower than 0.05 nM, 
optionally lower than 0.01 nM, or optionally lower than 0.005 nM, in the form of scFv, 
25 Fab, or other form of antibody measured at a temperature of about 4°C, 25°C, 37°C or 
42°C. 

Preferably, Ka of the anti-VEGF antibody, if in the form of scFv and measured at 
a temperature of 35-37°C, is lower than 1 nM, optionally lower than 0.8 nM, optionally 
lower than 0.5 nM, optionally lower than 0.2 nM, optionally lower than 0.1 nM, 
30 optionally lower than 0.08 nM, optionally lower than 0.05 nM, optionally lower than 0.01 
nM, or optionally lower than 0.005 nM. 

-5- 
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Also preferably, of the anti-VEGF antibody, if in the form of Fab and 
measured at a temperature of 35-37°C, is lower than 0.5 nM, optionally lower than 0.2 
nM, optionally lower than 0.1 nM, optionally lower than 0.08 nM, optionally lower than 
0.05 nM, optionally lower than 0.01 nM, or optionally lower than 0.005 nM. 
5 In inhibition of VEGF-induced proliferation of endothelial cells in vitro, the the 

anti-VEGF antibody preferably has an effective dose for inhibition of 50% cell 
proliferation ED 50 equal to or lower than 10 nM, optionally lower than 5 nM, optionally 
lower than 1 nM, optionally lower than 0.5 nM, optionally lower than 0.1 nM, optionally 
lower than 0.05 nM, or optionally lower than 0.01 nM in the form of scFv, Fab or other 
10 form of antibody. 

In a preferred embodiment, a monoclonal antibody is provided that specifically 
binds to a human VEGF and has V L comprising the amino acid sequence of 
Xi X2X3X4TQX5PSX6X7SX8X9X1 qGXi 1 Xi 2X1 3X1 4 IX] 5CX1 ^Xi 7SX1 aXi q IX? qX? 1 X??X?iX?a 

WQQX25PGX26APX27X 28 LX29Y2ko& 
15 X40LTIX4lX42LQX43X 4 4^^ 

wherein the underlined regions are designated as Vl/CDRI, VVCDR2, and VVCDR3, 
respectively, whereas the rest of the region is designated as framework, and wherein Xi is 
D, E or A; X 2 is I, or T; X 3 is V, E, K, R, Q, or T; X4 is M, or L; X 5 is S, or T, X 6 is S, 
or T; X 7 is L, or V; X 8 is A, or V; X 9 is S, or T; X 10 is P, V, L, A, or I; Xi 1 is E, or D; X !2 

20 is R, or T; X 13 is A, or V I; X 14 is T, or A; X ]5 is T, S, or A; X 16 is S, R, N, K, H, or Q; 
X 17 is A, or S; X 18 is Q, or R; X, 9 is S, D, A, or P; X 20 is S, G, R, T, or Y; X 2 i is T, N, S, 
D, or K; X 22 is Y, or D; X 23 is L, or I; X 24 is A, N, or T; X 25 is K, or I; X 26 is Q, K, T, or 
I; X 27 is R, K, Q, N, H, S, or E; X 28 is V, or L; X 29 is I, or V; X 30 is F, A, G , D, or S; 
X 3 , is A , or T; X 32 is S, or T; X 33 is N, S, R, or T; X 34 is A, H, or Q; X 35 is S , or G; X 36 

25 is P, T; X 37 is S, N, D, G, or Y; X 38 is S, or T; X 39 is G, or R; X40 is T, or A; X4, is S, or 
R; X42 is S, or R; X4 3 is P, or A; X44 is E , or D; X45 is F, V, or S; X46 is V, T, I , A , or S; 
X47 is Y, or S; X4 8 is S, Y, or N; X4 9 is S , or T; X 50 is T, V, A, P, K, G, S, or I; X51 is W , 
or Y; X 52 is Q , or G; X 53 is V, or L; and X54 is E, D , or A. 

Such preferred V L sequences may be combined with the preferred V H sequences 

30 or V H of other antibodies, provided that the antibody so produced binds to the human 
VEGF with a desired affinity. 

-6- 
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In another preferred embodiment, a monoclonal antibody is provided that 

specifically binds to a human VEGF and has V L comprising the amino acid sequence of 

XiX2X^LTQPPSX4SX5TPGOX6VTISCSGX 7 X R SNX g GX inNXiiVXnWYOOXnPGXi4A 
PKXi5LX,6YX 17 Mi8Xi9BPSGVPX 

5 CX^X7^ WDDSLX7 7 GYV FGX ? rGTX9qLTVL. wherein the underlined regions are 

designated as V^CDRl, Vi7CDR2, and Vl/CDR3, respectively, whereas the rest of the 
region is designated as framework, and wherein Xi is Q L, or N; X 2 is P A F, or S; X 3 is 
V, or M; X4 is A, or T; X 5 is G, or A; X 6 is R, or S; X 7 is S, or T; X 8 is S, T Y, or N; X 9 is 
I , or V; X10 is S, or R; Xi 1 is S, P, N, A, or T; X 12 is N, T , or Y; X, 3 is L, or F; X 14 is T, 

10 or A; X15 is V, L, or F; X 16 is M, or I; X 17 is G, T, or S; X 18 is N, or D; X19 is Q, or E; X 20 
is D, or E; X 2 i is F, or L; X 22 is K, or R; X 23 is G, or A; X 24 is Q, L, or R; X 25 is A, or G; 
X 26 is A, S, or T; X 27 is N, S, or T; X 28 is T, or A; and X 29 is K, or Q. 
/ Such preferred V L sequences may be combined with the preferred V H sequences 

or V H of other antibodies, provided that the antibody so produced binds to the human 

1 5 VEGF with a desired affinity. 

In yet another preferred embodiment, a monoclonal antibody is provided that 
specifically binds to a human VEGF and has V L comprising the amino acid sequence of 
QSALTQPPSVSGAPGQRVTISCTGRSSNIGAGHDVHWYQQLPGTAPKLLIYANDQ 
RPSGVPDRFSDSKSGTSASLGISGLRSEDEADYFCATWDDSLHGYVFGTGTKVTV 

20 L (SEQ ID No: 54). This V L sequence may be combined with the preferred V H 

sequences or V H of other antibodies, provided that the antibody so produced binds to the 
human VEGF with a desired affinity. 

In yet another preferred embodiment, a monoclonal antibody is provided that 
specifically binds to a human VEGF and has Vh comprising the amino acid sequence of 

25 X,X 2 QLVX 3 SGGGX4VQPGGX5LRLX6CAX 7 

GKGXi 6 EWVGWXi 7NTX1 8 Xi qGX 7 oX? , TYX, 7X^X, 4 FX^RRX,.TX,7SX^X,oX,nSKX, 
iX 32 X 33 YLQX 34 NSLRAEDTAV^ 

SS, wherein the underlined regions are designated as CDR1, CDR2, and CDR3, 
respectively, whereas the rest of the region is designated as framework according to 
30 Kabat nomenclature, and wherein Xi is E, or Q; X 2 is V, or G; X 3 is Q, or E; X4 is V, or 
L; X 5 is S, or T; X 6 is S T, or R; X 7 is A, or V; X 8 is Y, or F; X 9 is T, D, N, S, or A; X 10 is 
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F, or L; Xi i is T, D, Y, A, S, or N; Xi 2 is N, H, or S; X n is Y, or F; X, 4 is M, L, I, or V; 

X15 is I, V, or L; Xi 6 is L, or P; X n is I, or V; Xi 8 is Y, or N; Xi 9 is T, or N; X 2 o is E, or 

A; X21 is P, T, or S; X22 is A, or V; X23 is A, H, Q, P, D, or E; X24 is D, or E; X25 is K, or 

T; X26 is V, F, or L; X27 is F, or I; X 2 g is L, or R; X29 is D, or N; X30 is T, or N; X31 is S, 
5 or N; X 32 is T, Q, P, or K; X 33 is A, V, or P; X 34 is L, or M; X 35 is K, or R; X 36 is H, or Y; 

X 37 is S, R, or T; and X 3 g is G, or A. 

Such preferred V H sequences may be combined with the preferred V L sequences 

or V L of other antibodies, provided that the antibody so produced binds to the human 

VEGF with a desired affinity. 
10 In one embodiment, a monoclonal antibody is provided that specifically binds to a 

human VEGF and has V L comprising the amino acid sequence selected from the group 

consisting of SEQ ID NOs:2-54, more preferably comprising the amino acid sequence 

selected from the group consisting of SEQ ID NO: 14, SEQ ID NO:26, SEQ ID NO:28, 

SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:44, SEQ ID NO:47, and SEQ ID NO:54. 
15 Such preferred V L sequences may be combined with the preferred V H sequences 

or V H of other antibodies, provided that the antibody so produced binds to the human 

VEGF with a desired affinity. 

In one embodiment, a monoclonal antibody is provided that specifically binds to a 

human VEGF and has V H comprising the amino acid sequence selected from the group 
20 consisting of SEQ ID NOs:57-l 10 and SEQ ID NOs:285-310, and preferably comprising 

the amino acid sequence selected from the group consisting of SEQ ID NOs:61-64, SEQ 

ID NO:67, 68, 70, 75, 83, 88, 89, 90, 91, 92, 93, 94, and 96-110. 

Such preferred V H sequences may be combined with the preferred V L sequences 

or V L of other antibodies, provided that the antibody so produced binds to the human 
25 VEGF with a desired affinity. 

In yet another embodiment, a monoclonal antibody is provided that specifically 

binds to a human VEGF and has CDR1 in the V L region (V L /CDR1) comprising the 

amino acid sequence selected from the group consisting of SEQ ID NOs: 164-194. 

Such preferred V L /CDR1 may be combined with other regions of preferred light 
30 chain, or of other light chain, and the preferred heavy chain variable region sequence or 

-8- 
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with other heavy chain variable region sequences, provided that the antibody so produced 
binds to human VEGF with desired affinity. 

In yet another embodiment, a monoclonal antibody is provided that specifically 
binds to a human VEGF and has CDR2 in the V L region (V L /CDR2) comprising the 
5 amino acid sequence selected from the group consisting of SEQ ID NOs: 195-209. 

Such preferred Vl/CDR2 may be combined with other regions of preferred light 
chain, or of other light chain, and the preferred heavy chain variable region sequence or 
with other heavy chain variable region sequence, provided that the antibody so produced 
binds human VEGF. 

10 In yet another embodiment, a monoclonal antibody is provided that specifically 

binds to a human VEGF and has CDR3 in the V L region (V L /CDR3) comprising the 
amino acid sequence selected from the group consisting of SEQ ED NOs:21 0-228. 

Such preferred V L /CDR3 may be combined with other regions of preferred light 
chain, or of other light chain, and the preferred heavy chain variable region sequence or 

15 with other heavy chain variable region sequence, provided that the antibody so produced 
binds to human VEGF with desired affinity. 

In yet another embodiment, a monoclonal antibody is provided that specifically 
binds to a human VEGF and has a framework region (FR) CDR3 in the Vl region 
(Vl/FR) comprising the amino acid sequence selected from the group consisting of: SEQ 

20 ED NO:229-269, and preferably comprising the amino acid sequence selected from the 
group consisting of SEQ ED NO:232, 235, 237, 251, 255, 263, and 265. 

Such preferred Vl/FR may be combined with CDR regions of preferred light 
chain, or of other light chain, and the preferred heavy chain variable region sequence or 
with other heavy chain variable region sequence, provided that the antibody so produced 

25 binds to human VEGF with desired affinity. 

In one embodiment, a monoclonal antibody is provided that specifically binds to a 
human VEGF and has CDR1 in the V H region (V H /CDR1) comprising the amino acid 
sequence of GX 1 X 2 X3X4X5X 6 GX 7 N, wherein Xi is Y, or F; X 2 is D, N, T, S, or A; X 3 is 
F,orL;X4isT, D, S, Y, A,orN;X 5 isH, N, or S; X 6 is Y, or F; X 7 is M, L, I,orV. 

30 Further preferably V H /CDR1 of the monoclonal antibody comprises the amino 

acid sequence selected from the group consisting of: SEQ ED NOs:l 1 1-135. 

-9- 

2369565 6.DOC 



> PATENT 

Attorney Docket No: 26050-709.501 

Such preferred Vh /CDR1 may be combined with other regions of preferred heavy 
chain or of other heavy chain, and the preferred light chain variable region sequence or 
with other light chain variable region sequence, provided that the antibody so produced 
binds human VEGF. 

5 In one embodiment, a monoclonal antibody is provided that specifically binds to a 

human VEGF and has CDR2 in the V H region (V H /CDR2) comprising the amino acid 
sequence of WX 1 NTX 2 X3GEX4TYX5X 6 X 7 FX 8 R, wherein X, is I, or V; X 2 is Y, or N; X 3 
is T, or N; X4 is P, T, or S; X 5 is A, or V; X 6 is A, Q, P, H, D, or E; X 7 is D, or E; and X 8 
is K, or T 

10 Preferably V H /CDR2 of the monoclonal antibody comprises the amino acid 

sequence selected from the group consisting of: SEQ ID NOs: 136-156. 

Such preferred V H /CDR2 may be combined with other regions of preferred heavy 
chain or of other heavy chain, and the preferred light chain variable region sequence or 
with other light chain variable region sequence, provided that the antibody so produced 
1 5 binds to human VEGF with desired affinity. 

In one embodiment, a monoclonal antibody is provided that specifically binds to a 
human VEGF and has CDR3 in the V H region (V H /CDR3) comprising the amino acid 
sequence of KYPXi YYGX 2 SHWYFDV, wherein X, is Y, or H, and X 2 is R. 

Preferably, the anti-VEGF antibody V H /CDR3 has the amino acid sequence 
20 selected from the group consisting of SEQ ID NOs:3 1 1-337. 

Such preferred V H /CDR3 may be combined with other regions of preferred heavy 
chain or of other heavy chain, and the preferred light chain variable region sequences or 
with other light chain variable domain sequences, provided that the antibody so produced 
binds to human VEGF with desired affinity. 
25 In one embodiment, a monoclonal antibody is provided that specifically binds to a 

human VEGF and has FR in the V H region (V H /FR) comprising the amino acid sequence 
of 

X,VQLVX 2 SGGGX3VQPGGX4LRLX5CAX 6 S/CDR1/WX 7 RQAPGKGLEWVG/CDR2/ 
RX 8 TX 9 SX 1 oDX 11 SKX 12 Xi3X 1 4YLQX 15 NSLRAEDTAVYYCA/CDR3AVX, 6 QGTLVTV 
30 SS, wherein Xj is E, or Q; X 2 is Q, or E; X 3 is V, or L; X4 is S, or T; X 5 is S, T, or R; X 6 
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is A, or V; X 7 is I, or V; X 8 is F, or V; X 9 is F, or I; Xio is L, or R is Xi i is T, or N; X12 is 
S, or N; X !3 is T, Q, or K; X H is A, or V; Xj 5 is M, or L; and X !6 is G, or A. 

In one embodiment, a monoclonal antibody is provided that specifically binds to a 
human VEGF and has a Vl and V H pair selected from the group consisting of: SEQ ID 
5 NO:l and 70; SEQ ID NO:l and 67; SEQ ID NO:l and 75; SEQ ID NO:l and 83; SEQ 
ID NO:14 and 55; SEQ ID NO:l and 101; SEQ ID NO:l and 100; SEQ ID NO:14 and 
102; SEQ ID NO:l and 103; SEQ ID NO:l and:104; SEQ ID NO:l and 105; SEQ ID 
NO:36 and 100; SEQ ID NO:26 and 100; SEQ ID NO:28 and 100; SEQ ID NO:37 and 
100; SEQ ID NO:44 and 100; SEQ ID NO:54 and 100; and SEQ ID NO:47 and 100, 

10 preferably selected from the group consisting of SEQ ID NO:28 and 61; SEQ ID NO:28 
and 62; SEQ ID NO:28 and 63; SEQ ID NO:28 and 64; SEQ ID NO:28 and 68; SEQ ID 
NO:28 and 85; SEQ ID NO:28 and 86; SEQ ID NO:28 and 87; SEQ ID NO:28 and 88; 
SEQ ID NO:28 and 89; SEQ ID NO:28 and 90; SEQ ID NO:28 and 91; SEQ ID NO:28 
and 92; SEQ ID NO:28 and 93; SEQ ID NO:28 and 94; SEQ ID NO:28 and 95; SEQ ID 

15 NO:28 and 96; SEQ ID NO:28 and 97; SEQ ID NO:28 and 98; SEQ ID NO:28 and 99; 
SEQ ID NO:28 and 106; SEQ ID NO:28 and 107; SEQ ID NO:28 and 108; SEQ ID 
NO:28 and 109; and SEQ ID NO:28 and 1 10. 

The binding affinity of the above-described antibody to a human VEGF, 
represented by dissociation constant K<j, is optionally lower than 100 nM, optionally 

20 lower than 10 nM, optionally lower than 8 nM, optionally lower than 5 nM, optionally 
lower than 1 nM, optionally lower than 0.8 nM, optionally lower than 0.5 nM, optionally 
lower than 0.2 nM, optionally lower than 0.1 nM, optionally lower than 0.08 nM, 
optionally lower than 0.05 nM, optionally lower than 0.01 nM, or optionally lower than 
0.005 nM, in the form of scFv, Fab, or other form of antibody measured at a temperature 

25 of about 4°C, 25°C, 37°C or 42°C. 

The antibodies resulted from combination of the full-length V H and Vl, Vh/CDR, 
V H /FR, Vl/CDR, VlTFR (e.g., the hit variants provided in Figures 1 A and IB and amino 
acid sequences shown in Figures 1C and ID) disclosed in the present invention are within 
the scope of the present invention; and such combination does not include the anti-VEGF 

30 antibodies disclosed in US Patent application serial No: 09/056,160, publication No: 
2002/0032315. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1A shows an AA-PVP profile of V L hit variants designed by using the 
5 inventive methodology in silico. 

Figure IB shows an AA-PVP profile of V H hit variants designed by using the 
inventive methodology in silico. 

10 Figure 1C shows amino acid sequences of full-length V L , Vi7CDR, and Vl/FR of 

certain embodiments of the antibodies according to the present invention. 

Figure ID shows amino acid sequences of full-length V H , V H /CDR, and V H /FR of 
certain embodiments of the antibodies according to the present invention. 

15 

Figures 2 A and 2B show the affinity analysis of a panel of variants of humanized 
anti-VEGF antibody using BioCore biosensor at 25°C. Antibody fragments in the format 
of single chain were expressed as described below and purified. The measurement was 
done by measuring the change of SPR units (y-axis) vs time (x-axis) when a purified 

20 antibody binds its antigen (VEGF) immobilized on the CMS biochip at 25°C. Kd values 
were calculated by the ratio of Ko n to Koff. A ratio of a variant Kd to that of hABl for 
each variant antibody was also calculated. is defined as association rate constant 
expressed as the reciprocal of the multiple of concentration of the antibody fragment (in 
molar) and time (in second); whereas Koff is defined as dissociation rate constant 

25 expressed as the reciprocal of time (in second); Kd is defined as dissociation constant, 
which is the ratio of Koff to Ko n expressed in molar concentration. hABl contains SEQ1 
and SEQ55 (which, if in the form of Fab, would be essentially Fab- 12 as described in 
Chen et al. (1999) J. Mol. Biol. 293:865-881); hAB2 contains SEQ1 and SEQ70; hAB3 
contains SEQ1 and SEQ67; hAB4 contains SEQ1 and SEQ83; hAB5 SEQ1 and SEQ75; 

30 hAB9 contains SEQ14 and SEQ102; hABlO contains SEQ1 and SEQ103; hABl 1 
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contains SEQ1 and SEQ104; hAB12 contains SEQ1 and SEQ105, as V L and V H , 
respectively. 



Figure 2C shows the affinity analysis of a panel of variants of humanized anti- 
5 VEGF antibody using BioCore biosensor at 35°C. Antibody fragments in the format of 
single chain were expressed as described below and purified. The measurement was 
done by measuring the change of SPR units (y-axis) vs time (x-axis) when a purified 
antibody binds its antigen (VEGF) immobilized on the CMS biochip at 35°C. K<i values 
were calculated by the ratio of to Koff- A ratio of a variant Kd to that of hABl for 

10 each variant antibody was also calculated. hABl contains SEQ1 and SEQ55 (which, if in 
the form of Fab, would be essentially Fab-12 as described in Chen et al. (1999) J. Mol. 
Biol. 293:865-881); hAB7 contains SEQ1 and SEQ101; hAB8 contains SEQ1 and 
SEQ100; hAB13 contains SEQ1 and SEQ56 (which, if in the form of Fab, would be 
Y0317 as described in Chen et al. (1999) J. Mol. Biol. 293:865-881); hAB14 contains 

15 SEQ36 and SEQ100; hAB15 contains SEQ26 and SEQ100; hAB16 contains SEQ28 and 
SEQ100; hAB35 contains SEQ28 and SEQ106; hAB36 contains SEQ28 and SEQ107; 
hAB37 contains SEQ28 and SEQ108; hAB38 contains SEQ28 and SEQ109; hAB39 
contains SEQ28 and SEQ1 10, as V L and V H , respectively. 

20 Figure 3 shows the affinity analysis of six humanized anti-VEGF antibodies using 

BIAcore biosensor. The antibodies were incubated in lxPBS buffer at 4°C, 37°C, or 
42°C for 16 hours before the assay. The measurement was done by measuring the change 
of SPR units (y-axis) vs time (x-axis) when a purified antibody binds its antigen (VEGF) 
immobilized on the CMS biochip at 25°C. Both the on-rate and off-rate changes were 

25 determined from the data fitting using 1:1 Langmuir binding model, whereas IQs were 
determined by the ratio of to Ko n . Composition of the V L and V H , respectively, is 
listed as following: hABl contains SEQ1 and SEQ55; hAB2 contains SEQ1 and SEQ70; 
hAB3 contains SEQ1 and SEQ67; hAB7 contains SEQ1 and SEQ101; hAB8 contains 
SEQ1 and SEQ100; hAB13 contains SEQ1 and SEQ56, as V L and V H , respectively. 
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Figure 4 summarized the data of the stability of the humanized anti-VEGF 
antibodies shown in Figure 3. The y-axis shows the percentage of the antibody remain 
active in binding to the immobilized VEGF antigen using BIAcore at 25°C after the 
purified antibody is incubated at 4°C, 37°C and 42°C for 16 hours as described in Figure 
5 3. The maximal binding of each antibody at different conditions was expressed as 
percentile of that of 4°C. 

Figure 5 shows the expression of the humanized anti-VEGF antibodies using 
E.coli expression system as described below. The expression level of each antibody 

10 fragment was evaluated by applying the same volume of the purified material from the 
same fraction as shown detected by SDS-PAGE/Coomassie blue staining. Composition 
of the V L and V H , respectively, is listed as following: hABl contains SEQ1 and SEQ55; 
hAB35 contains SEQ28 and SEQ106; hAB36 contains SEQ28 and SEQ107; hAB37 
contains SEQ28 and SEQ108; hAB38 contains SEQ28 and SEQ109; hAB39 contains 

15 SEQ28 and SEQ1 10, as V L and V H , respectively. 

Figure 6 shows expression of humanized anti-VEGF antibody in a eukaryotic 
system as described below. The expression level of each antibody fragment was 
evaluated by applying the same volume of the purified material from the same fraction as 
20 shown detected by SDS-PAGE/Coomassie blue staining. Composition of the VL and 
VH, respectively, is listed as following: hABl contains SEQ1 and SEQ55; hAB2 
contains SEQ1 and SEQ70; hAB3 contains SEQ1 and SEQ67; hAB5 contains SEQ1 and 
SEQ75; hAB7 contains SEQ1 and SEQ101; hAB8 contains SEQ1 and SEQ100; hAB13 
contains SEQ1 and SEQ56, as V L and V H , respectively. 

25 

Figure 7 illustrates a map of the vector used for expression of soluble antibody 
fragments in E. coli. 

DETAILED DESCRIPTION OF THE INVENTION 

30 
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The present invention provides an innovative methodology for engineering 
proteins, in particular antibodies, with desired structures and functions. In one aspect, 
methods are provided for designing and selecting antibodies against human antigens with 
high affinity and specificity. In some particular embodiments, methods are provided for 
5 designing and selecting humanized or fully human antibodies against vascular endothelial 
growth factor (VEGF) with high affinity and specificity. In another aspect, compositions, 
kits and methods are provided for using these antibodies and their derivatives to inhibit 
angiogenesis in vitro, and for diagnosing or treating diseases associated with abnormal 
angiogenesis such as cancer, rheumatoid arthritis, ischemic-reperfusion related brain 
10 edema and injury, cortical ischemia, ovarian hyperplasia and hypervascularity, 

endometriosis, psoriasis, diabetic retinopaphy, and other ocular angiogenic diseases. 

1. VEGF and Antibodies Against VEGF 

15 VEGF is a key angiogenic factor in development and is involved in the growth of 

solid tumor by stimulating endothelial cells. A murine monoclonal antibody was found 
to block VEGF-dependent cell proliferation and slow the tumor growth in vivo (Kim KJ, 
Li B, Winer J, Armanini M, Gillett N, Phillips HS, Ferrara N (1993) Nature 362, 841- 
844). This murine antibody was humanized (Presta LG, Chen H, O'Connor SJ, Chisholm 

20 V, Meng YG, Krummen L, Winkler M, Ferrara N (1997) Cancer Res. 57, 4593-4599; 
Baca M, Presta LG, O'Connor SJ, Wells JA (1997) J Biol Chem 272, 10678-10684) and 
affinity-matured by using phage-display and off-rate selection (Chen Y, Wiesmann C, 
Fuh G, Li B, Christinger HW, McKay P, de Vos AM (1999) J Mol Biol 293, 865-881). 
X-ray structure for the complex formed between VEGF and the parental antibody was 

25 reported (Muller YA, Chen Y, Christinger HW, Li B, Cunningham, BC, Lowman HB, de 
Vos AM (1998) Structure 6, 1 153-1 167.), as well as the one formed between VEGF and 
the matured antibody (Chen Y, Wiesmann C, Fuh G, Li B, Christinger HW, McKay P, de 
Vos AM, Lowman HB (1999) J. Mol Biol 293, 865-881). In addition, US Patent 
application serial No: 09/056,160, publication No: 2002/0032315, discloses certain anti- 

30 VEGF antibodies. These publications on VEGF and anti-VEGF antibodies are herein 
incorporated by reference in their entirety. 
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The present invention provides novel anti-VEGF antibodies that bind to human 
VEGF with high affinity and specificity. These anti-VEGF antibodies are humanized and 
optimized with some important attributes in binding affinity, stability, expression 
efficiency which are desirable for research, diagnostic and therapeutic applications. 

5 The binding affinity of the selected anti-VEGF antibodies of the present invention 

to a human VEGF, represented by dissociation constant IQ, is optionally lower than 100 
nM, optionally lower than 10 nM, optionally lower than 8 nM, optionally lower than 8 
nM, optionally lower than 5 nM, optionally lower than 1 nM, optionally lower than 0.8 
nM, optionally lower than 0.5 nM, optionally lower than 0.2 nM, optionally lower than 

10 0.1 nM, optionally lower than 0.08 nM, optionally lower than 0.05 nM, optionally lower 
than 0.01 nM, or optionally lower than 0.005 nM, in the form of scFv, Fab, or other form 
of antibody measured at a temperature of about 4°C, 25°C, 37°C or 42°C 

Preferably, Kd of the selected anti-VEGF antibodies of the present invention, if in 
the form of scFv and measured at a temperature of 35-37°C, is lower than 1 nM, 

15 optionally lower than 0.8 nM, optionally lower than 0.5 nM, optionally lower than 0.2 
nM, optionally lower than 0.1 nM, optionally lower than 0.08 nM, optionally lower than 
0.05 nM, optionally lower than 0.01 nM, or optionally lower than 0.005 nM. 

Also preferably, K^of the selected anti-VEGF antibodies of the present invention, 
if in the form of Fab and measured at a temperature of 35-37°C, is lower than 0.5 nM, 

20 optionally lower than 0.2 nM, optionally lower than 0.1 nM, optionally lower than 0.08 
nM, optionally lower than 0.05 nM, optionally lower than 0.01 nM, or optionally lower 
than 0.005 nM. 

In inhibition of VEGF-induced proliferation of endothelial cells in vitro the 
selected anti-VEGF antibodies of the present invention preferably has an ED 5 o (effective 
25 dose for inhibition of 50% cell proliferation) equal to or lower than 10 nM, optionally 
lower than 5 nM, optionally lower than 1 nM, optionally lower than 0.5 nM, optionally 
lower than 0.1 nM, optionally lower than 0.05 nM, or optionally lower than 0.01 nM in 
the form of scFv, Fab or other form of antibody. 

In a preferred embodiment, an anti-VEGF antibody is provided that has a light 
30 chain variable region comprising the amino acid sequence of: 



- 16- 

2369565 6.DOC 



PATENT 

Attorney Docket No: 26050-709.501 
Xi X^X^X^TQXsPSX^XtSXrXqXi nGXi i Xi ?Xi 1X14IX1 sCXi fiXi 7SX1 aXi q DG qX? 1 X22X21X24 

OTQQX25PGX 26 APX 2 7X 28 LX 29 YX3oX3lX32^ 

X^LTIX^X^LOX^X^^ 

wherein the underlined regions are designated as Vl/CDRI, VVCDR2, and Vi7CDR3, 
5 respectively, whereas the rest of the region is designated as framework, and wherein the 
position designated as "X" could be amino acids listed below: 





Xi * 


D E or A 




X 2 : 


I, orT 




Xt • 


V E K R O orT 


10 


Xa * 


M, or L 




X* ■ 


S orT 




x* • 


S, orT 




Xt ■ 


L, or V 






A, or V 


15 


Xq ' 


S, orT 




✓Via 


• P V L A or I 




Xll 


E, or D 




yV 1 "> 


: R, or T 




Xn 


. A, or V I 


20 


X14 


T or A 




Xis 


T S or A 




Xi* 


S R N K H orO 




Xl7 


A or S 




Xl8 


Q, or R 


25 


X19 


S D A orP 




X 2 0 


S G R T orY 




X?1 


T N S D orK 




X 22 


V orD 




x<^ 

^23 


L orT 


30 


X 2 4 


A, N, orT 




X 25 


K, or I 




X26 


Q, K, T,or I 




X27 


R, K, Q, N, H, S, or E 




X28 


V, orL 


35 


X29 


I, orV 




X30 


F, A, G , D, or S 




X 3 1 


A , or T 




X 32 


S, orT 




X33 


N, S, R, or T 


40 


X34 : 


A, H, or Q 




X 35 : 


S ,orG 




X 3 6 • 


P,T 




X 37 : 


S, N, D, G, or Y 



-17- 

2369565 6.DOC 



PATENT 

Attorney Docket No: 26050-709.501 

X 38 :S,orT 
X 39 : G, or R 
X4 0 :T, or A 
X41 : S, or R 
5 X42 : S, or R 
X43 : P, or A 
X44 : E , or D 
X4 5 :F,V,orS 
X46 : V, T, I , A , or S 
10 X4 7 :Y,orS 
X4 8 :S, Y, orN 
X49 : S , or T 

X 50 : T, V, A, P, K, G, S, or I 
X 5 , : W,orY 
15 X 5 2.:Q,orG 
X 53 :V,orL 
X54: E, D , or A. 

Such preferred light chain variable domain sequences may be combined with the 
preferred heavy chain variable domain sequences, or with other heavy chain variable 
20 domain sequences, provided that the antibody so produced binds to human VEGF with a 
desired affinity. 

In another preferred embodiment, an anti-VEGF antibody is provided that has a 
light chain variable region comprising the amino acid sequence of: 
XiXzXjLTQPPSXtSXsTPGOXfiVTISCSGXvXsSNXQGXinNXnVX^WYQQX^PGX,^ 
25 PKX I 5LX l6 YXi7Mi8X,9RPSGVPX 2 oRX 21 SGSX22SX 23 TSASLAISGLX24SEDEADYY 
CX^X^ WDDSLX^GYVF GX^GTX^LTVL. wherein the underlined regions are 
designated as Vl/CDRI, VJCDR2, and Vi/CDR3, respectively, whereas the rest of the 
region is designated as framework, and wherein the position designated as "X" could be 
amino acids listed below: 

30 X,:QL,orN 

X 2 :PAF,orS 

X 3 : V, or M 

X4 : A, or T 

X 5 : G, or A 
35 X 6 : R, or S 

X 7 : S, or T 

X 8 :S,TY,orN 

X 9 : 1 , or V 

Xi 0 :S, orR 
40 X n :S,P,N,A,orT 
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•NT or Y 


-'Mi 


: L, or F 


■* *■ I *♦ 


: T, or A 




: V, L, or F 


-'Mo 


: M, or I 


✓V 1 ^ 


• G T or S 


/V i o 


: N, or D . 


^Vl Q 

-* M y 


: Q, or E 




: D, or E 


X?i 


: F, or L 




: K, or R 




: G, or A 




: Q, L, or R 


x 25 


: A, or G 


X26 


: A, S, or T 


X27 


:N, S, orT 


X28 


: T, or A 


X29 


: K, or Q. 



20 Such preferred light chain variable domain sequences may be combined with the 

preferred heavy chain variable domain sequences, or with other heavy chain variable 
domain sequences, provided that the antibody so produced binds to human VEGF with 
desired affinity. 

In yet another preferred embodiment, an anti-VEGF antibody is provided that has 
25 a light chain variable region comprising the amino acid sequence of: 

QSALTQPPSVSGAPGQRVTISCTGRSSNIGAGHDVHWYQQLPGTAPKLLIYANDQ 
RPSGVPDRFSDSKSGTSASLGISGLRSEDEADYFCATWDDSLHGYVFGTGTKVTV 
L (SEQ ID NO:54), such preferred light chain variable domain sequences may be 
combined with the heavy chain variable domain sequences, or with other heavy chain 
30 variable domain sequences, provided that the antibody so produced binds to human 
VEGF with desired affinity. 

In yet another preferred embodiment, an anti-VEGF antibody is provided that has 
a heavy chain variable region comprising the amino acid sequence of: 
XiX2QLVX3SGGGX4VOPGGX^LRLX6CAX7SGXRXQXioXiiXi9XnGXi4NWXisROAP 
35 GKGXi *EWVGWXi 7 NTX, 8 Xi 9GX20X2 1 TYX^ ^X^X^FX^RRX^TX^SX^X.oX^SKX, 
tX^X^YLQX^NSLRAEDTAVYYCAX^ YPX^YYGX^SHWYFDV WX^OGTLVTV 
SS, wherein the underlined regions are designated as CDR1, CDR2, and CDR3, 
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respectively, whereas the rest of the region is designated as framework according to 
Kabat nomenclature, and wherein the position designated as "X" could be amino acids 
listed below: 





Xi : 


E, or Q 


5 


X 2 : 


V.orG 




X 3 : 


Q, orE 




X4: 


V, orL 




X 5 : 


S, orT 




X 6 : 


S T, or R 


10 


X 7 : 


A, orV 




X 8 : 


Y, orF 




X 9 : 


T, D, N, S, or A 




X10 


: F, or L 




X„ 


: T, D, Y, A, S, or N 


15 


X12 


: N, H, or S 




X 13 


: Y, orF 




X14 


: M, L, I, or V 




X 15 


: I, V, or L 




Xi6 


:L, orP 


20 


X, 7 


:I, orV 




Xi8 


: Y, or N 




X19 


:T, orN 




X20 


: E, or A 




X21 


: P, T, or S 


25 


X22 


: A, orV 




X23 


: A, H, Q, P, D, or E 




X 2 4 


: D, or E 




X25 


:K, orT 




X26 


: V, F, or L 


30 


X27 


: F, or I 




X28 


: L, or R 




X29 


:D,orN 




X30 


:T, orN 




x 3 , 


:S, orN 


35 


X32 


:T, Q, P, orK 




x 33 


: A, V, or P 




X34 


: L, or M 




x 35 


:K,orR 




X36 


:H,orY 


40 


X37 


: S, R, or T 




X38 


: G, or A. 
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In yet another preferred embodiment, an anti-VEGF antibody is provided that has 
a heavy chain variable region comprising the amino acid sequence of: 
XiX7QLVX^SGGGX4VOPGGXsLRLX^CAX7SGXRXQXioXiiXi7XnGXi4NWXisRQAP 
GKGXi sEWVGWXi 7NTX1 8 Xi qGX^X?, TYX^ X^X^^FX^RRX^TX^SX^X^nSKX. 
5 iX 3 2X33YLQX 3 4NSLRAEDTAVYYCAX^^ 

WX4gQGTLVTV 5 wherein the underlined regions are designated as CDR1, CDR2, and 
CDR3, respectively, whereas the rest of the region is designated as framework according 
to Kabat nomenclature, and wherein the position designated as "X" could be amino acids 
listed below: 



10 




F or O 




X 2 : 


V, orG 




X 3 : 


Q, orE 




X4: 


V, orL 




. X 5 : 


S, orT 


15 


X 6 : 


S T, or R 




X 7 : 


A,orV 




X 8 : 


Y, orF 




X 9 : 


T, D, N, S, or A 




X10 


: F, or L 


20 


x„ 


:T,D, Y, A, S, orN 




X12 


: N, H, or S 




X, 3 


: Y, orF 




X14 


: M, L, I, or V 




X15 


: I, V, or L 


25 


Xi6 


: L, or P 




X17 


: I, or V 




Xig 


: Y, orN 




X19 


:T, orN 




X20 


: E, or A 


30 


X21 


: P, T, or S 




X22 


: A, orV 




X 23 


: A, H, Q, P, D, or E 




X24 


:D, orE 




X25 


:K,orT 


35 


X26 


: V, F, or L 




x 27 


: F, or I 




X28 


: L, or R 




X29 


:D, orN 




X30 


:T,orN 


40 


X31 


: S, orN 




X32 


:T, Q, P, orK 
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X33: 


: A, V, or P 


X34 : 


: L, or M 


X35: 


:KR, orH 


X36 : 


: Y A D, or S 


X 37 : P R S, or G 


X38 ', 


:.YH, orD 


X39 : 


: Y, or F 


X40 : 


: YNS, orH 


X41 : 


: G , or S 


X42 : 


: S, T, R, G, or A 


X43 : 


: S, Y, C,orT 


X44 : 


: H, P, C, N, Q, or S 


X45 : 


: W, Q,orC 


X46 : 


: F , or L 


X47 : 


: V, L, or Y 


X48 : 


: G, or A. 



Such preferred heavy chain variable domain sequences may be combined with the 
preferred light chain variable domain sequences or with other light chain variable domain 
20 sequences, provided that the antibody so produced binds to human VEGF with desired 
affinity. 

In one embodiment, the invention provides an anti-VEGF antibody that preferably 
contains a light chain variable domain comprising the amino acid sequences of one of the 
following: SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, 

25 SEQ ED NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:l 1, SEQ ID 
NO:12, SEQ ID NO:13, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID 
NO:18, SEQ ID NO:19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID 
NO:23, SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, SEQ ID 
NO:30, SEQ ID NO:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID 

30 NO:35, SEQ ID NO:38„ SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID 
NO:42, SEQ ID NO:43, SEQ ID NO:45, SEQ ID NO:46, SEQ ID NO:48, SEQ ID 
NO:49, SEQ ID NO:50, SEQ ID NO:51, SEQ ID NO:52, and SEQ ID NO:53, further 
preferably SEQ ID NO: 14, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:36, SEQ ID 
NO:37, SEQ ID NO:44, SEQ ID NO:47, or SEQ ID NO:54. Such preferred light chain 

35 variable domain sequences may be combined with the preferred heavy chain variable 
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domain sequences or with other heavy chain variable domain sequences, provided that 
the antibody so produced binds to human VEGF with desired affinity. 

In another embodiment, the invention provides an anti-VEGF antibody that 
preferably contains a heavy chain variable domain comprising the amino acid sequence 
5 of one of the following: SEQ ID NO: ID NO:57, SEQ ID NO: ID NO:58, SEQ ID NO: 
ID NO:59, SEQ ID NO: ID NO:60, SEQ ID NO: ID NO:65, SEQ ID NO: ID NO:66, 
SEQ ED NO: ID NO:69, SEQ ID NO: ID NO:71, SEQ ID NO: ID NO:72, SEQ ID NO: 
ID NO:73, SEQ ID NO: ID NO:74, SEQ ID NO:76, SEQ ID NO:77, SEQ ID NO:78, 
SEQ ID NO:79, SEQ ID NO:80, SEQ ID NO:81, SEQ ID NO:82, SEQ ID NO:84, SEQ 

10 ID NO:85, SEQ ID NO:86, SEQ ID NO:87, further preferably SEQ ID NO:61, or SEQ 
ID NO:62, or SEQ ID NO:63, or SEQ ID NO:64, or SEQ ID NO:67, or SEQ ID NO:68, 
or SEQ ID NO:70, or SEQ ID NO:75, or SEQ ID NO:83, SEQ ID NO:88, or SEQ ID 
NO:89, or SEQ ID NO:90, or SEQ ID NO:91, or SEQ ID NO:92, or SEQ ID NO:93, or 
SEQ ID NO:94, or SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID NO:98, or 

15 SEQ ID NO:99, SEQ ID NO: 100, or SEQ ID NO: 101, or SEQ ID NO: 102, or SEQ ID 
NO: 103, or SEQ ID NO: 104, or SEQ ID NO: 105, or SEQ ID NO: 106, SEQ ID NO: 107, 
or SEQ ID NO:108, or SEQ ID NO:109, SEQ ID NO:l 10; or one of SEQ ID NOs:285- 
310. Such preferred heavy chain variable region sequence may be combined with the 
preferred light chain variable region sequence or with other light chain variable region 

20 sequence, provided that the antibody so produced binds to Human VEGF with desired 
affinity. 

In yet another embodiment, the invention provides an anti-VEGF antibody that 
preferably contains the CDR1 regions of the light chain variable domain comprising the 
amino acid sequence of one of the following: SEQ ID NO:164, SEQ ID NO:165, SEQ ID 

25 NO:166, SEQ ID NO:167, SEQ ID NO:168, SEQ ID NO:169, SEQ ID NO:170, SEQ ID 
NO:171, SEQ ID NO:172, SEQ ID NO:173, SEQ ID NO:174, SEQ ID NO:175, SEQ ID 
NO: 176, SEQ ID NO: 177, SEQ ID NO: 178, SEQ ID NO: 179, SEQ ID NO: 180, SEQ ID 
NO: 181, SEQ ID NO: 182, SEQ ID NO: 183, SEQ ID NO: 184, SEQ ID NO: 185, SEQ ID 
NO:186, SEQ ID NO:187, SEQ ID NO:188, SEQ ID NO:189, SEQ ID NO:190, SEQ ID 

30 NO: 191, SEQ ID NO: 192, SEQ ID NO: 193, and SEQ ID NO: 194. Such preferred CDR1 
sequence of light chain variable domain may be combined with other regions of preferred 
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light chain, or of other light chain, and the preferred heavy chain variable region 
sequence or with other heavy chain variable region sequence, provided that the antibody 
so produced binds to human VEGF with desired affinity. 

In yet another embodiment, the invention provides an anti-VEGF antibody that 
5 preferably contains the CDR2 regions of the light chain variable domain comprising the 
amino acid sequence of one of the following: SEQ ID NO: 195, SEQ ID NO: 196, SEQ ID 
NO:197, SEQ ID NO:198, SEQ ID NO:199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID 
NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID 
NO:207, SEQ ID NO:208, and SEQ ID NO:209. Such preferred sequence of light chain 
10 variable domain may be combined with other regions of preferred light chain, or of other 
light chain, and the preferred heavy chain variable region sequence or with other heavy 
chain variable region sequence, provided that the antibody so produced binds human 
VEGF. 

In yet another embodiment, the invention provides an anti-VEGF antibody that 
15 preferably contains the CDR3 regions of the light chain variable domain comprising the 
amino acid sequence of one of the following: SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID 
NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID 
NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID 
NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID 
20 NO:227, and SEQ ID NO:228. Such preferred CDR3 sequence of light chain variable 
domain may be combined with other regions of preferred light chain, or of other light 
chain, and the preferred heavy chain variable region sequence or with other heavy chain 
variable region sequence, provided that the antibody so produced binds to human VEGF 
with desired affinity. 

25 In yet another embodiment, the invention provides an anti-VEGF antibody that 

preferably contains the framework regions of the light chain variable domain comprising 
the amino acid sequence of one of the following: SEQ ID NO:229, SEQ ID NO:230, 
SEQ ID NO:231SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:236, SEQ ID NO:238, 
SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, SEQ ID NO:242, SEQ ID NO:243, 

30 SEQ ID NO:244, SEQ ID NO:245, SEQ ID NO:246, SEQ ID NO:247, SEQ ID NO:248, 
SEQ ID NO:249, SEQ ID NO:250, SEQ ID NO:252, SEQ ID NO:253, SEQ ID NO:254, 
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SEQ ID NO:256, SEQ ED NO:257, SEQ ID NO:258, SEQ ID NO:259, SEQ ID NO:260, 

SEQ ID NO:261, SEQ ID NO:262, SEQ ID NO:264, SEQ ID NO:266, SEQ ID NO:267, 

SEQ ID NO:268, and SEQ ID NO:269; further preferably contains the framework 

regions of the light chain variable domain comprising the amino acid sequence of one of 

5 the following: SEQ ID NO:232, SEQ ID NO:235, SEQ ID NO:237, SEQ ID NO:251, 

SEQ ID NO:255, SEQ ID NO:263, and SEQ ID NO:265. Such preferred framework 

region sequence of light chain variable region may be combined with CDR regions of 

preferred light chain, or of other light chain, and the preferred heavy chain variable 

region sequence or with other heavy chain variable region sequence, provided that the 

1 0 antibody so produced binds to human VEGF with desired affinity. 

In one embodiment, the invention provides an anti-VEGF antibody that preferably 

contains the CDR1 regions of the heavy chain variable domain comprising the amino 

acid sequence of one of the following: GX1X2X3X4X5X6GX7N, wherein the position 

designated as "X" could be amino acids listed below: 

15 Xi:Y,orF 

X 2 :D,N, T, S,orA 
X 3 : F, or L 

X,:T, D, S, Y, A, or N 
X 5 : H, N, or S 
20 X 6 :Y,orF 

X 7 :M, L, I , or V. 

Further preferably the CDR1 region of the heavy chain variable domain 
comprises the amino acid sequence of one of the following: SEQ ID NO: 111, SEQ ID 

25 NO: 113, SEQ ID NO: 114, SEQ ID NO: 11 5, SEQ ID NO: 116, SEQ ID NO: 117, SEQ ID 
NO: 118, SEQ ID NO: 11 9, SEQ ID NO: 120, SEQ ID NO: 121, SEQ ID NO: 122, SEQ ID 
NO:123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID NO: 127, SEQ ID 
NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID NO:131, SEQ ID NO:132, SEQ ID 
NO: 133, SEQ ID NO: 134, and SEQ ID NO: 135. Such preferred CDR1 sequence of 

30 heavy chain variable domain may be combined with other regions of preferred heavy 
chain or of other heavy chain, and the preferred light chain variable region sequence or 
with other light chain variable region sequence, provided that the antibody so produced 
binds human VEGF. 
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In one embodiment, the invention provides an anti-VEGF antibody that preferably 

contains CDR2 of the heavy chain variable domain comprising the amino acid sequence 

of one of the following: WX i NTX2X3GEX4T YX5X6X7FX8R, wherein the position 

designated as "X" could be amino acids listed below: 

5 X,:I,orV 
X 2 :Y,orN 
X 3 :T, orN 
X4: P, T, or S 
X 5 : A, or V 
10 X 6 : A, Q, P, H, D, or E 
X 7 : D, or E 
X 8 : K, or T 

Further preferably CDR2 of the heavy chain variable domain comprise the amino 

15 acid sequence of one of the following: SEQ ID NO:136,SEQ ID NO: 137, SEQ ID 

NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID NO:141, SEQ ID NO:142, SEQ ID 
NO: 143, SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID 
NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, SEQ ID NO: 151, SEQ ID NO: 152, SEQ ID 
NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, and SEQ ID NO: 156. Such preferred CDR2 

20 sequence of heavy chain variable domain may be combined with other regions of 
preferred heavy chain or of other heavy chain, and the preferred light chain variable 
region sequence or with other light chain variable region sequence, provided that the 
antibody so produced binds to human VEGF with desired affinity. 

In one embodiment, the invention provides an anti-VEGF antibody that contains 

25 CDR3 of the heavy chain variable domain comprising the amino acid sequence: 

KYPX1YYGX2SHWYFDV, wherein the position designated as "X" could be amino 
acids listed below: Xi : Y, or H, and X 2 : R. 

Preferably, the anti-VEGF antibody has CDR3 of the heavy chain variable 
domain comprising the amino acid sequence of one of SEQ ID NOs:31 1-337, and the 

30 following sequences: 

HSRHYYGSSPQYFDV 
KYGYYYGSSHWYFDV 
KYPHYYGASHWYFDV 
KYPHYYGGCHWYFDV 
35 KYPHYYGGSHWYFDV 
KYPHYYGGYNQYFDV 
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KYPHYYGRSHWYFDV 
KYPHYYGRSQWYLDV 
KYPHYYSRTCQYFDV 
KYPHYYSSSHWYFDV 
KYPYFYGSSHWYFDV 
KYPYYHGSSHWYFDV 
KYPYYNGSSHWYFDV 
KYPYYNSTSHWYFDV 
KYPYYSGTSHWYFDV 
KYPYYSGTSHWYFDY 
KYPYYYGRSHWYFDV 
KYPYYYGSSHWYFDV 
KYPYYYGSSSWYFDV 
KYPYYYSTSHWYFDV 
KYRDFNGS SHWYFDV 
KYSYYYGSSHWYFDV 
RARHYYGS SHCYFDL 
RDSHYYGSSHQYFDL 
KYPHYYGTSHWYFDV 
KYPHYYGS SHWYFDV 
KYPYYYGTSHWYFDV . 

Such preferred CDR3 sequence of heavy chain variable domain may be combined 
with other regions of preferred heavy chain or of other heavy chain, and the preferred 
25 light chain variable region sequences or with other light chain variable domain 

sequences, provided that the antibody so produced binds to human VEGF with desired 
affinity. 

In one embodiment, the invention provides an anti-VEGF antibody that preferably 
contains the framework region of the heavy chain variable domain comprising the amino 
30 acid sequences of one of the following: 

XiVQLVX 2 SGGGX3VQPGGX4LRLX 5 CAX 6 S/CDRl/WX 7 RQAPGKGLEWVG/CDR2/ 
RX 8 TX9SX 1 oDXiiSKXi2X 13 X,4YLQX,5NSLRAEDTAVYYCA/CDR3/WX, 6 QGTLVTV 
SS, wherein the position designated as "X" could be amino acids listed below: 

Xi : E, or Q 
35 X 2 : Q, or E 

X 3 : V, or L 

Xt : S, or T 

X 5 : S, T, or R 

X 6 : A, or V 
40 X 7 :I,orV 

X 8 :F,orV 
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X 9 : 


F, or I 




L, orR 


Xn 


T, orN 


X, 2 


S, orN 


X 13 


T, Q,orK 


X14 


A, orV 


X 15 


M, or L 


Xi6 


G, or A. 



10 In one embodiment, the invention provides an anti-VEGF antibody that preferably 

contains a light chain variable domain and a heavy chain variable domain comprising the 
amino acid sequences of one of the following V L and V H pairs: SEQ ID NO:l and SEQ 
ID NO:70; SEQ ID NO:l and SEQ ID NO:67; SEQ ID NO:l and SEQ ID NO:75; SEQ 
ID NO:l and SEQ ID NO:83; SEQ ID NO:14 and SEQ ID NO:55; SEQ ID NO:l and 

15 SEQ ID NO:101; SEQ ID NO:l and SEQ ID NO:100; SEQ ID NO:14 and SEQ ID 

NO: 102; SEQ ID NO:l and SEQ ID NO: 103; SEQ ID NO:l and SEQ ID NO: 104; SEQ 
ID NO:l and SEQ ID NO:105; SEQ ID NO:36 and SEQ ID NO:100; SEQ ID NO:26 and 
SEQ ID NO: 100; SEQ ID NO:28 and SEQ ID NO: 100; SEQ ID NO:37 and SEQ ID 
NO:100; SEQ ID NO:44 and SEQ ID NO:100; SEQ ID NO:54 and SEQ ID NO:100; 

20 SEQ ID NO:47 and SEQ ID NO: 1 00, further preferably SEQ ID NO:28 and SEQ ID 
NO:61; SEQ ID NO:28 and SEQ ID NO:62; SEQ ED NO:28 and SEQ ID NO:63; SEQ 
ID NO:28 and SEQ ID NO:64; SEQ ID NO:28 and SEQ ID NO:68; SEQ ID NO:28 and 
SEQ ID NO:85; SEQ ID NO:28 and SEQ ID NO:86; SEQ ID NO:28 and SEQ ID 
NO:87; SEQ ID NO:28 and SEQ ID NO:88; SEQ ID NO:28 and SEQ ID NO:89; SEQ 

25 ID NO:28 and SEQ ID NO:90; SEQ ID NO:28 and SEQ ID NO:91; SEQ ID NO:28 and 
SEQ ID NO:92; SEQ ID NO:28 and SEQ ID NO:93; SEQ ID NO:28 and SEQ ID 
NO:94; SEQ ID NO:28 and SEQ ID NO:95; SEQ ID NO:28 and SEQ ID NO:96; SEQ 
ID NO:28 and SEQ ID NO:97; SEQ ID NO:28 and SEQ ID NO:98; SEQ ID NO:28 and 
SEQ ID NO:99; SEQ ID NO:28 and SEQ ID NO:106; SEQ ID NO:28 and SEQ ID 

30 NO: 107; SEQ ID NO:28 and SEQ ID NO:108; SEQ ID NO:28 and SEQ ID NO: 109; and 
SEQ ID NO:28 and SEQ ID NO: 1 10. 

The antibodies resulted from combination of the full-length V H and V L , V H /CDR, 
Vh/FR, Vl/CDR, V[/FR (e.g., the hit variants provided in Figures 1 A and IB and amino 
acid sequences shown in Figures 1C and ID) disclosed in the present invention are within 
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the scope of the present invention; and such combination does not include the anti-VEGF 
antibodies disclosed in US Patent application serial No: 09/056,160, publication No: 
2002/0032315. 

5 2. Methodology for Designing and Constructing Humanized or Fully Human 
Anti-VEGF Antibodies 

The antibodies of the present invention are designed using innovative methods 
involving construction and selection of protein libraries in silico and in vitro. The 
following describes some aspects of the methodology. More detailed description of the 
10 methodology appears in U.S. Patent Application Nos: 10/443,134, 10/153,159, 

10/153,176, 10/125,687, and 60/284,407, which are incorporated herein by reference in 
their entirety. 

According to the present invention, an innovative methodology is provided for 
efficiently generating and screening protein libraries for optimized proteins with desirable 

15 biological functions, such as improved binding affinity towards biologically and 

therapeutically important target molecules. The methodology is used to optimize proteins 
by generating novel variants of a protein with enhanced properties. In particular, this 
methodology is used to design libraries for humanization of non-human antibodies and to 
optimize the affinity and other attributes of antibodies. Novel variants of amino acids 

20 and nucleic acids of antibodies are generated with human or human-like sequences while 
their binding affinity, stability, and expression efficiency are improved significantly. 

The inventive process is carried out computationally in a high throughput manner 
by mining the ever-expanding databases of protein sequences of all organisms, especially 
human and by relating their specific sequences or their variants with functional 

25 enhancement such as binding affinity and stability that are tested experimentally. By 
using the inventive methodology, an expanded and yet functionally biased library of 
proteins such as antibodies can be constructed based on computational evaluation of 
extremely diverse protein sequences and functionally relevant structures in silico and 
subsequently tested by experimental screening and selection in vitro or in vivo. 
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In one aspect of the invention, a method is provided for designing and selecting 
protein(s) with desirable fiinction(s). The method is preferably implemented in a 
computer through in silico selection of protein sequences based on the amino acid 
sequence of a target structural/functional motif or domain in a lead protein, herein after 
5 referred to as the "lead sequence". The lead sequence is employed to search databases of 
protein sequences. The choice of the database depends on the specific functional 
requirement of the designed motifs. For example, if the lead protein is an enzyme and the 
target motif includes the active site of the enzyme, databases of proteins/peptides of a 
particular origin, organism, species or combinations thereof, may be queried using 

10 various search criteria to yield a hit list of sequences each of which can substitute the 
target motif in the lead protein. A similar approach may be used for designing other 
motifs or domains of the lead protein. The designed sequences for each individual 
motif/domain may be combined to generate a library of designed proteins. In addition, to 
reduce immunogenicity of the designed proteins for human applications such as 

15 therapeutics or diagnosis, databases of proteins of human origin or humanized proteins 
are preferably searched to yield the hit list of sequences, especially for motifs derived 
from sites of the lead protein that serves as the scaffolding of the lead proteins sue as the 
frameworks of an antibody. The library of designed proteins can be tested 
experimentally to yield proteins with improved biological function(s) over the lead 

20 protein. 

In a particular aspect of the invention, the inventive methodology is implemented 
in designing antibodies that are diverse in sequence and yet functionally related to each 
other. Based on the designed antibody sequences, a library of antibodies can be 
constructed to include diverse sequences in the complementary determining regions 

25 (CDRs) and/or humanized frameworks (FRs) of a non-human antibody in a high 

throughput manner. This library of antibodies can be screened against a wide variety of 
target molecules for novel or improved functions. 

In yet another aspect of the invention, a method is provided for in silico selection 
of antibody sequences based on the amino acid sequence of a region in a lead antibody, 

30 herein after referred to as the "lead sequence". The lead sequence is employed to search 
databases of protein sequences. The choice of the database depends on the specific 
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functional requirement of the designed motifs. For example: in order to design the 
framework regions of variable chains for therapeutic application, collections of protein 
sequences that are evolutionarily related such as fully human immunoglobulin sequences 
and human germline immunoglobulin sequences should be used except for a few 
5 structurally critical sites. This would reduce the immunogenic response by preserving the 
origin of the sequences by introducing as few foreign mutants as possible in this highly 
conserved region (for framework regions). On the other hand, diverse sequence 
databases such as immunoglobulin sequences of various species or even unrelated 
sequence in genbank can be used to design the CDRs in order to improve binding affinity 

10 with antigens in this highly variable region. By using the method, a library of diverse 

antibody sequences can be constructed and screened experimentally in vitro or in vivo for 
antibody mutants with improved or desired function(s). 

In one embodiment, the method comprises the steps of: providing an amino acid 
sequence of the variable region of the heavy chain (V H ) or light chain (Vl) of a lead 

1 5 antibody; identifying the amino acid sequences in the CDRs of the lead antibody; 
selecting one of the CDRs in the V H or V L region of the lead antibody; providing an 
amino acid sequence that comprises at least 3 consecutive amino acid residues in the 
selected CDR, the selected amino acid sequence being a lead sequence; comparing the 
lead sequence with a plurality of tester protein sequences; and selecting from the plurality 

20 of tester protein sequences at least two peptide segments that have at least 15% sequence 
identity with the lead sequence, the selected peptide segments forming a hit library. 

The method may further comprise the step of: constructing a nucleic acid library 
comprising DNA segments encoding the amino acid sequences of the hit library. 

Optionally, the method may further comprise the steps of: building an amino acid 

25 positional variant profile of the hit library; converting amino acid positional variant 

profile of the hit library into a nucleic acid positional variant profile by back-translating 
the amino acid positional variants into their corresponding genetic codons; and 
constructing a degenerate nucleic acid library of DNA segments by combinatorially 
combining the nucleic acid positional variants. 

30 Optionally, the genetic codons may be the ones that are preferred for expression 

in bacteria. Optionally, genetic codons may be the ones that can reduce the size chosen 
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such that the diversity of the degenerate nucleic acid library of DNA segments is within 
the experimentally coverable diversity without undue experimental efforts, for example, 
to be below lxlO 7 , and preferably below lxlO 6 . 

In another embodiment, the method comprises the steps of: providing an amino 
5 acid sequence of the variable region of the heavy chain (V H ) or light chain (V L ) of a lead 
antibody; identifying the amino acid sequences in the CDRs and FRs of the lead 
antibody; selecting one of the CDRs in the V H or V L region of the lead antibody; 
providing a first amino acid sequence that comprises at least 3 consecutive amino acid 
residues in the selected CDR, the selected amino acid sequence being a CDR lead 
10 sequence; comparing the CDR lead sequence with a plurality of CDR tester protein 
sequences; selecting from the plurality of CDR tester protein sequences at least two 
peptide segments that have at least 15% sequence identity with the CDR lead sequence, 
the selected peptide segments forming a CDR hit library; selecting one of the FRs in the 
V H or V L region of the lead antibody; providing a second amino acid sequence that 
15 comprises at least 3 consecutive amino acid residues in the selected FR, the selected 

amino acid sequence being a FR lead sequence; comparing the FR lead sequence with a 
plurality of FR tester protein sequences; and selecting from the plurality of FR tester 
protein sequences at least two peptide segments that have at least 15% sequence identity 
with the FR lead sequence, the selected peptide segments forming a FR hit library; and 
20 combining the CDR hit library and the FR hit library to form a hit library. 

According to the method, the plurality of CDR tester protein sequences may 
comprise amino acid sequences of human or non-human antibodies. 

Also according to the method, the plurality of FR tester protein sequences may 
comprise amino acid sequences of human origins, preferably human or humanized 
25 antibodies (e.g., antibodies with at least 50% human sequence, preferably at least 70% 
human sequence, more preferably at least 90 % human sequence, and most preferably at 
least 95% human sequence in V H or V L ), more preferably fully human antibodies, and 
most preferably human germline antibodies. 

Also according to the method, at least one of the plurality of CDR tester protein 
30 sequences is different from the plurality of FR tester protein sequences. 
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Also according to the method, the plurality of CDR tester protein sequences are 
human or non-human antibody sequences and the plurality of FR tester protein sequences 
are human antibody sequences, preferably human germline antibody sequences. 

The method may further comprise the step of: constructing a nucleic acid library 
5 comprising DNA segments encoding the amino acid sequences of the hit library. 

Optionally, the method may further comprise the steps of: building an amino acid 
positional variant profile of the CDR hit library; converting the amino acid positional 
variant profile of the CDR hit library into a first nucleic acid positional variant profile by 
back-translating the amino acid positional variants into their corresponding genetic 
10 codons; and constructing a degenerate CDR nucleic acid library of DNA segments by 
combinatorially combining the nucleic acid positional variants. 

Optionally, the genetic codons may be the ones that are preferred for expression 
in bacteria. Optionally, genetic codons may be the ones that can reduce the size chosen 
such that the diversity of the degenerate nucleic acid library of DNA segments is within 
15 the experimentally coverable diversity without undue experimental efforts, such as 
diversity below 1 x 1 0 7 , preferably below 1 x 1 0 6 . 

In yet another embodiment, the method comprises the steps of: providing an 
amino acid sequence of the variable region of the heavy chain (V H ) or light chain (V L ) of 
a lead antibody; identifying the amino acid sequences in the FRs of the lead antibody; 
20 selecting one of the FRs in the V H or V L region of the lead antibody; providing a first 
amino acid sequence that comprises at least 3 consecutive amino acid residues in the 
selected FR, the selected amino acid sequence being a first FR lead sequence; comparing 
the first lead FR sequence with a plurality of FR tester protein sequences; and selecting 
from the plurality of FR tester protein sequences at least two peptide segments that have 
25 at least 15% sequence identity with the first FR lead sequence, the selected peptide 
segments forming a first FR hit library. 

The method may further comprise the steps of: providing a second amino acid 
sequence that comprises at least 3 consecutive amino acid residues in a FR that is 
different from the selected FR, the selected amino acid sequence being a second FR lead 
30 sequence; comparing the second FR lead sequence with the plurality of FR tester protein 
sequences; and selecting from the plurality of FR tester protein sequences at least two 
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peptide segments that have at least 15% sequence identity with the second FR lead 
sequence, the selected peptide segments forming a second FR hit library; and combining 
the first FR hit library and the second FR hit library to form a hit library. 

According to the method, the lead CDR sequence may comprise at least 5 
5 consecutive amino acid residues in the selected CDR. The selected CDR may be selected 
from the group consisting of V H CDR1, V H CDR2, V H CDR3, V L CDR1, V L CDR2, and 
V L CDR3 of the lead antibody. 

Also according to the method, the lead FR sequence may comprise at least 5 
consecutive amino acid residues in the selected FR. The selected FR may be selected 

10 from the group consisting of V H FR1, V H FR2, V H FR3, V H FR4, V L FR1, V L FR2, V L 
FR3 and V L FR4 of the lead antibody. 

The method may further comprise the step of: constructing a nucleic acid or 
degenerate nucleic acid library comprising DNA segments encoding the amino acid 
sequences of the hit library. 

15 In another aspect of the invention, a method is provided for in silico selection of 

antibody sequences based on the amino acid sequence of a region in a lead antibody, i.e., 
the "lead sequence", and its 3D structure. The structure of the lead sequence is employed 
to search databases of protein structures for segments having similar 3D structures. 
These segments are aligned to yield a sequence profile, herein after referred to as the 

20 "lead sequence profile". The lead sequence profile is employed to search databases of 
protein sequences for remote homologues of the lead sequence having low sequence 
identity and yet structurally similar. By using the method, a library of diverse antibody 
sequences can be constructed and screened experimentally in vitro or in vivo for antibody 
mutants with improved or desired function(s). 

25 In one embodiment, the method comprises the steps of: providing an amino acid 

sequence of the variable region of the heavy chain (V H ) or light chain (V L ) of a lead 
antibody; identifying the amino acid sequences in the CDRs of the lead antibody; 
selecting one of the CDRs in the V H or V L region of the lead antibody; providing an 
amino acid sequence that comprises at least 3 consecutive amino acid residues in the 

30 selected CDR, the selected amino acid sequence being a lead sequence; providing a three- 
dimensional structure of the lead sequence; building a lead sequence profile based on the 
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structure of the lead sequence; comparing the lead sequence profile with a plurality of 
tester protein sequences; and selecting from the plurality of tester protein sequences at 
least two peptide segments that have at least 10% sequence identity with lead sequence, 
the selected peptide segments forming a hit library. 
5 According to the method, the three-dimensional structure of the lead sequence 

may be a structure derived from X-crystallography, nuclear magnetic resonance (NMR) 
spectroscopy or theoretical structural modeling. 

According to the method, the step of building a lead sequence profile may 
include: comparing the structure of the lead sequence with the structures of a plurality of 

10 tester protein segments; determining the root mean square difference of the main chain 
conformations of the lead sequence and the tester protein segments; selecting the tester 
protein segments with root mean square difference of the main chain conformations less 
than 5 A, preferably less than 4 A, more preferably less than 3 A, and most preferably 
less than 2 A; and aligning the amino acid sequences of the selected tester protein 

1 5 segments with the lead sequence to build the lead sequence profile. 

Optionally, the structures of the plurality of tester protein segments are retrieved 
from the protein data bank. 

Optionally, the step of building a lead sequence profile may include: comparing 
the structure of the lead sequence with the structures of a plurality of tester protein 

20 segments; determining the Z-score of the main chain conformations of the lead sequence 
and the tester protein segments; selecting the segments of the tester protein segments with 
the Z-score higher than 2, preferably higher than 3, more preferably higher than 4, and 
most preferably higher than 5; and aligning the amino acid sequences of the selected 
tester protein segments with the lead sequence to build the lead sequence profile. 

25 Optionally, the step of building a lead sequence profile may be implemented by 

an algorithm selected from the group consisting of CE, MAPS, Monte Carlo and 3D 
clustering algorithms. 

The method may further comprise the step of: constructing a nucleic acid library 
comprising DNA segments encoding the amino acid sequences of the hit library. 

30 Optionally, the method may further comprise the steps of: building an amino acid 

positional variant profile of the hit library; converting amino acid positional variant 
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profile of the hit library into a nucleic acid positional variant profile by back-translating 
the amino acid positional variants into their corresponding trinucleotide codons; and 
constructing a degenerate nucleic acid library of DNA segments by combinatorial^ 
combining the nucleic acid positional variants. 
5 In yet another aspect of the invention, a method is provided for in silico selection 

of antibody sequences based on a 3D structure of a lead antibody. A lead sequence or 
sequence profile from a specific region of the lead antibody to be employed to search 
databases of protein sequences for remote homologues of the lead sequence having low 
sequence identity and yet structurally similar. These remote homologues form a hit 

10 library. The sequences in the hit library are subjected to evaluation for their structural 
compatibility with a 3D structure of the lead antibody, hereinafter referred to as the "lead 
structural template". Sequences in the hit library that are structurally compatible with the 
lead structural template are selected and screened experimentally in vitro or in vivo for 
antibody mutants with improved or desired function(s). 

15 In one embodiment, the method comprises the steps of: providing an amino acid 

sequence of the variable region of the heavy chain (V H ) or light chain (V L ) of a lead 
antibody, the lead antibody having a known three dimensional structure which is defined 
as a lead structural template; identifying the amino acid sequences in the CDRs of the 
lead antibody; selecting one of the CDRs in the V H or V L region of the lead antibody; 

20 providing an amino acid sequence that comprises at least 3 consecutive amino acid 

residues in the selected CDR, the selected amino acid sequence being a lead sequence; 
comparing the lead sequence profile with a plurality of tester protein sequences; selecting 
from the plurality of tester protein sequences at least two peptide segments that have at 
least 10% sequence identity with lead sequence, the selected peptide segments forming a 

25 hit library; determining if a member of the hit library is structurally compatible with the 
lead structural template using a scoring function; and selecting the members of the hit 
library that score equal to or better thanor equal to the lead sequence. 

According to the method, the scoring function is an energy scoring function 
selected from the group consisting of electrostatic interactions, van der Waals 

30 interactions, electrostatic solvation energy, solvent-accessible surface solvation energy, 
and conformational entropy. 
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Optionally, the scoring function is one incorporating a forcefield selected from the 
group consisting of the Amber forcefield, Charmm forcefield, the Discover cvff 
forcefields, the ECEPP forcefields, the GROMOS forcefields, the OPLS forcefields, the 
MMFF94 forcefield, the Tripos forcefield, the MM3 forcefield, the Dreiding 
5 forcefield,and UNRES forcefield,and other knowledge-based statistical forcefield (mean 
field) and structure-based thermodynamic potential functions. 

Also according to the method, the step of selecting the members of the hit library 
includes selecting the members of the hit library that have a lower or equal total energy 
than that of the lead sequence calculated based on a formula of 

1 0 AEtotal ~ E v dw Ebond E a nge1 ^ Electrostatics E so lvation 

Also according to the method, the step of selecting the members of the hit library 
includes selecting the members of the hit library that have a lower binding free energy 
than that of the lead sequence calculated as the difference between the bound and 
1 5 unbound states using a refined scoring function 

AGb = AGmm + AGsoi -TAS SS 

where 

20 AG M M = AG e ie +AG vdw (l) 

AGsoi =AG e i e -soi +AG A sa(2) 

The method may further comprise the step of: constructing a nucleic acid library 
comprising DNA segments encoding the amino acid sequences of the hit library. 

25 Optionally, the method may further comprise the steps of: building an amino acid 

positional variant profile of the hit library; converting amino acid positional variant 
profile of the hit library into a nucleic acid positional variant profile by back-translating 
the amino acid positional variants into their corresponding trinucleotide codons; and 
constructing a degenerate nucleic acid library of DNA segments by combinatorially 

30 combining the nucleic acid positional variants. 
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In yet another aspect of the invention, a method is provided for in silico selection 
of antibody sequences based on a 3D structure or structure ensemble of a lead antibody, 
or a structure ensemble of multiple antibodies, hereinafter collectively referred to as the 
lead structural template. A lead sequence or sequence profile from a specific region of 
5 the lead antibody to be employed to search databases of protein sequences for remote 
homologues of the lead sequence having low sequence identity and yet structurally 
similar. These remote homologues form a hit library. An amino acid positional variant 
profile (AA-PVP) of the hit library is built based on frequency of amino acid variant 
appearing at each position of the lead sequence. Based on the AA-PVP, a hit variant 

10 library is constructed by combinatorially combining the amino acid variant at each 
position of the lead sequence with or without cutoff of low frequency variants. The 
sequences in the hit variant library are subjected to evaluation for their structural 
compatibility with the lead structural template. Sequences in the hit library that are 
structurally compatible with the lead structural template are selected and screened 

15 experimentally in vitro or in vivo for antibody mutants with improved or desired 
function(s). 

In one embodiment, the method comprises the steps of: providing an amino acid 
sequence of the variable region of the heavy chain (V H ) or light chain (V L ) of a lead 
antibody, the lead antibody having a known three dimensional structure which is defined 

20 as a lead structural template; identifying the amino acid sequences in the CDRs of the 
lead antibody; selecting one of the CDRs in the V H or V L region of the lead antibody; 
providing an amino acid sequence that comprises at least 3 consecutive amino acid 
residues in the selected CDR, the selected amino acid sequence being a lead sequence; 
comparing the lead sequence with a plurality of tester protein sequences; selecting from 

25 the plurality of tester protein sequences at least two peptide segments that have at least 
10% sequence identity with lead sequence, the selected peptide segments forming a hit 
library; building an amino acid positional variant profile of the hit library based on 
frequency of amino acid variant appearing at each position of the lead sequence; 
combining the amino acid variants in the hit library to produce a combination of hit 

30 variants which form a hit variant library; determining if a member of the hit variant 
library is structurally compatible with the lead structural template using a scoring 
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function; and selecting the members of the hit variant library that score equal to or better 
than the lead sequence. 

According to the method, the step of combining the amino acid variants in the hit 
library includes: selecting the amino acid variants with frequency of appearance higher 
5 than 2%, preferably 5%, more preferably 8% times, and most preferably 10% of the 

amino acid occurrence frequency for the cutoff and then include some of the amino acids 
from the lead sequence if they are missed after cutoff; and combining the selected amino 
acid variants in the hit library to produce a combination of hit variants which form a hit 
variant library. 

10 According to the method, the scoring function is an energy scoring function 

selected from the group consisting of electrostatic interactions, van der Waals 
interactions, electrostatic solvation energy, solvent-accessible surface solvation energy, 
and conformational entropy. 

Optionally, the scoring function is one incorporating a forcefield selected from the 

15 group consisting of the Amber forcefield, Charmm forcefield, the Discover cvff 

forcefields, the ECEPP forcefields, the GROMOS forcefields, the OPLS forcefields, the 
MMFF94 forcefield, the Tripos forcefield, the MM3 forcefield, the Dreiding 
forcefield,and UNRES forcefield,and other knowledge-based statistical forcefield (mean 
field) and structure-based thermodynamic potential functions. 

20 The method may further comprise the step of: constructing a nucleic acid library 

comprising DNA segments encoding the amino acid sequences of the selected members 
of the hit variant library. 

Optionally, the method may further comprise the steps of: partitioning theparsing 
the selected members of hit variant library into at least two sub-hit variant libraries; 

25 selecting a sub-hit variant library; building an amino acid positional variant profile of the 
selected sub-hit variant library; converting the amino acid positional variant profile of the 
selected sub-hit variant library into a nucleic acid positional variant profile by back- 
translating the amino acid positional variants into their corresponding trinucleotide 
codons; and constructing a degenerate nucleic acid library of DNA segments by 

30 combinatorially combining the nucleic acid positional variants. 
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The step of parsing the hit variant library may include: randomly selecting 10-30 
members of the hit variant library that score equal to or better than the lead sequence, the 
selected members forming a sub-variant library. 

Optionally, the step of parsing the hit variant library may include: building an 
5 amino acid positional variant profile of the hit variant library, resulting a hit variant 
profile; parsing the hit variant profile into segments of sub-variant profile based on the 
contact maps of the Cot, or CP or heavy atoms of the structure or structure ensembles of a 
lead sequence within certain distance cutoff (8 A to 4.5 A). A structural model or lead 
structural template within a distance of 8 A, preferably within 6 A, more preferably 

10 within 5 A, and most preferably within 4.5 A. 

In another embodiment, the method comprises the steps of: providing an amino 
acid sequence of the variable region of the heavy chain (V H ) or light chain (V L ) of a lead 
antibody, the lead antibody having a known three dimensional structure; providing 3D 
structures of one or more antibodies with different sequences in V H or V L region than that 

15 of the lead antibody; forming a structure ensemble by combining the structures of the 
lead antibody and the one or more antibodies; the structure ensemble being defined as a 
lead structural template; identifying the amino acid sequences in the CDRs of the lead 
antibody; selecting one of the CDRs in the V H or V L region of the lead antibody; 
providing an amino acid sequence that comprises at least 3 consecutive amino acid 

20 residues in the selected CDR, the selected amino acid sequence being a lead sequence; 
comparing the lead sequence with a plurality of tester protein sequences; selecting from 
the plurality of tester protein sequences at least two peptide segments that have at least 
10% sequence identity with lead sequence, the selected peptide segments forming a hit 
library; building an amino acid positional variant profile of the hit library based on 

25 frequency of amino acid variant appearing at each position of the lead sequence; 

combining the amino acid variants in the hit library to produce a combination of hit 
variants which form a hit variant library; determining if a member of the hit variant 
library is structurally compatible with the lead structural template using a scoring 
function; and selecting the members of the hit variant library that score equal to or better 

30 than the lead sequence. 
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In a particular embodiment, the method comprises the steps of: a) providing an 
amino acid sequence of the variable region of the heavy chain (V H ) or light chain (V L ) of 
a lead antibody, the lead antibody having a known three dimensional structure; b) 
identifying the amino acid sequences in the CDRs of the lead antibody; c) selecting one 
5 of the CDRs in the V H or V L region of the lead antibody; d) providing an amino acid 
sequence that comprises at least 3 consecutive amino acid residues in the selected CDR, 
the selected amino acid sequence being defined as a lead sequence; e) comparing the lead 
sequence with a plurality of tester protein sequences; f) selecting from the plurality of 
tester protein sequences at least two peptide segments that have at least 10% sequence 

10 identity with lead sequence, the selected peptide segments forming a hit library; g) 

building an amino acid positional variant profile of the hit library based on frequency of 
amino acid variant appearing at each position of the lead sequence; h) combining the 
amino acid variants in the hit library to produce a combination of hit variants which form 
a hit variant library; I) determining if a member of the hit variant library is structurally 

15 compatible with the lead structural template using a scoring function; j) selecting the 
members of the hit variant library that score equal to or better than the lead sequence; k) 
constructing a degenerate nucleic acid library comprising DNA segments encoding the 
amino acid sequences of the selected members of the hit variant library; 1) determining 
the diversity of the nucleic acid library, if the diversity is higher than lxl 0 6 , repeating 

20 steps j) through 1) until the diversity of the diversity of the nucleic acid library is equal to 
or lower than lxl 0 6 ; m) introducing the DNA segments in the degenerate nucleic acid 
library into cells of a host organism; n) expressing the DNA segments in the host cells 
such that recombinant antibodies containing the amino acid sequences of the hit library 
are produced in the cells of the host organism; o) selecting the recombinant antibody that 

25 binds to a target antigen with affinity higher than 10 6 M" 1 ; and p) repeating steps e) 

through o) if no recombinant antibody is found to bind to the target antigen with affinity 
higher than 10 6 M _1 . 

In another particular embodiment, the method comprises the steps of: 
a) providing an amino acid sequence of the variable region of the heavy chain (Vh) or 

30 light chain (Vl) of a lead antibody, the lead antibody having a known three dimensional 
structure which is defined as a lead structural template; b) identifying the amino acid 



2369565 6.DOC 



-41- 



PATENT 

Attorney Docket No: 26050-709.501 

sequences in the CDRs of the lead antibody; c) selecting one of the CDRs in the V H or V L 
region of the lead antibody; d) providing an amino acid sequence that comprises at least 3 
consecutive amino acid residues in the selected CDR, the selected amino acid sequence 
being defined as a lead sequence; e) mutating the lead sequence by substituting one or 
5 more of the amino acid residues of the lead sequence with one or more different amino 
acid residues, resulting in a lead sequence mutant library; f) determining if a member of 
the lead sequence mutant library is structurally compatible with the lead structural 
template using a first scoring function; g) selecting the lead sequence mutants that score 
equal to or better than the lead sequence; h) comparing the lead sequence with a plurality 

10 of tester protein sequences; I) selecting from the plurality of tester protein sequences at 
least two peptide segments that have at least 10% sequence identity with lead sequence, 
the selected peptide segments forming a hit library; j) building an amino acid positional 
variant profile of the hit library based on frequency of amino acid variant appearing at 
each position of the lead sequence; k) combining the amino acid variants in the hit library 

15 to produce a combination of hit variants; 1) combining the selected lead sequence mutants 
with the combination of hit variants to produce a hit variant library; m) determining if a 
member of the hit variant library is structurally compatible with the lead structural 
template using a second scoring function; n) selecting the members of the hit variant 
library that score equal to or better than the lead sequence; o) constructing a degenerate 

20 nucleic acid library comprising DNA segments encoding the amino acid sequences of the 
selected members of the hit variant library; p) determining the diversity of the nucleic 
acid library, and if the diversity is higher than lxlO 6 , repeating steps n) through p) until 
the diversity of the diversity of the nucleic acid library is equal to or lower than lxl 0 6 ; q) 
introducing the DNA segments in the degenerate nucleic acid library into cells of a host 

25 organism; r) expressing the DNA segments in the host cells such that recombinant 

antibodies containing the amino acid sequences of the hit library are produced in the cells 
of the host organism; s) selecting the recombinant antibody that binds to a target antigen 
with affinity higher than 10 6 M" 1 ; and t) repeating steps e) through s) if no recombinant 
antibody is found to bind to the target antigen with affinity higher than 10 6 M" 1 . 

30 In yet another aspect of the present invention, a computer-implemented method is 

provided for constructing a library of mutant antibodies based on a lead antibody. 
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In one embodiment, the method comprises: taking as an input an amino acid 
sequence that comprises at least 3 consecutive amino acid residues in a CDR region of 
the lead antibody, the amino acid sequence being a lead sequence; employing a computer 
executable logic to compare the lead sequence with a plurality of tester protein 
5 sequences; selecting from the plurality of tester protein sequences at least two peptide 
segments that have at least 15% sequence identity with lead sequence; and generating as 
an output the selected peptide segments which form a hit library. 

According to any of the above methods, the length of the. lead sequence is 
preferably between 5-100 aa, more preferably between 6-80 aa, and most preferably 
10 between 8-50 aa. 

According to any of the above methods, the step of identifying the amino 
sequences in the CDRs is carried out by using Kabat criteria or Chothia criteria. 

Also according to any of the above methods, the lead sequence may comprise an 
amino acid sequence from a particular region within the V H or V L of the lead antibody, 
15 CDR1, CDR2 or CDR3, or from a combination of the CDR and FRs, such as CDR1-FR2, 
FR2-CDR2-FR3, and the full-length V H or V L sequence. The lead sequence preferably 
comprises at least 6 consecutive amino acid residues in the selected CDR, more 
preferably at least 7 consecutive amino acid residues in the selected CDR, and most 
preferably all of the amino acid residues in the selected CDR. 
20 Also according to any of the above methods, the lead sequence may further 

comprise at least one of the amino acid residues immediately adjacent to the selected 
CDR. 

Also according to any of the above methods, the lead sequence may further 
comprise at least one of the FRs flanking the selected CDR. 
25 Also according to any of the above methods, the lead sequence may further 

comprise one or more CDRs or FRs adjacent the C-terminus or N-terminus of the 
selected CDR. 

Also according to any of the above methods, the lead structural template may be a 
3D structure of a fully assembled lead antibody, or a heavy chain or light chain variable 
30 region of the lead antibody (e.g., CDR, FR and a combination thereof). 
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Also according to any of the above methods, the plurality of tester protein 
sequences includes preferably antibody sequences, more preferably human antibody 
sequences, and most preferably human germline antibody sequences (V-database), 
especially for the framework regions. 
5 Also according to any of the above methods, the plurality of tester protein 

sequences is retrieved from genbank of the NIH or Swiss-Prot database or the Kabat 
database for CDRs of antibodies. 

Also according to any of the above methods, the step of comparing the lead 
sequence with the plurality of tester protein sequences is implemented by an algorithm 
10 selected from the group consisting of BLAST, PSI-BLAST, profile HMM, and 
COBLATH. 

Also according to any of the above methods, the sequence identity of the selected 
peptide segments in the hit library with the lead sequence is preferably at least 25%, 
preferably at least 35%, and most preferably at least 45%. 
1 5 According to any of the above method, the method further comprises the 

following steps: introducing the DNA segments in the nucleic acid or degenerate nucleic 
acid library into cells of a host organism; expressing the DNA segments in the host cells 
such that recombinant antibodies containing the amino acid sequences of the hit library 
encoded by the nucleic acid or degenerate nucleic acid library are produced in the cells of 
20 the host organism; and selecting the recombinant antibody that binds to a target antigen 
with affinity higher than 1 0 6 M" 1 , optionally 1 0 7 M~ l , optionally 1 0 8 M" 1 , optionally 1 0 9 
M" 1 , optionally 2xl0 9 M' 1 , optionally 5xl0 9 M" 1 , optionally lxlO 10 M"\ optionally 5xl0 10 
M" 1 , and optionally lxlO 11 M* 1 . 

The recombinant antibodies may be fully assembled antibodies, Fab fragments, 
25 Fv fragments, or single chain antibodies. 

The host organism includes any organism or its cell line that is capable of 
expressing transferred foreign genetic sequence, including but not limited to bacteria, 
yeast, plant, insect, and mammals. 

The recombinant antibodies may be fully assembled antibodies, Fab fragments, 
30 Fv fragments, or single chain antibodies. For example, the recombinant antibodies may 
be expressed in bacterial cells and displayed on the surface of phage particles. The 
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recombinant antibodies displayed on phage particles may be a double-chain heterodimer 
formed between V H and V L . The heterodimerization of V H and V L chains may be 
facilitated by a heterodimer formed between two non-antibody polypeptide chains fused 
to the Vh and V L chains, respectively. For example, these two non-antibody polypeptide 
5 may be derived from a heterodimeric receptors GABA B Rl (GR1) and R2 (GR2), 
respectively. 

Alternatively, the recombinant antibodies displayed on phage particles may be a 
single-chain antibody containing V H and V L linked by a peptide linker. The display of 
the single chain antibody on the surface of phage particles may be facilitated by a 

10 heterodimer formed between a fusion of the single chain antibody with GR1 and a fusion 
of phage pill capsid protein with GR2. 

The target antigen to be screened against includes small molecules and 
macromolecules such as proteins, peptides, nucleic acids and polycarbohydrates. 

Any of the above methods may further comprise the following steps: introducing 

15 the DNA segments in the nucleic acid or degenerate nucleic acid library into cells of a 
host organism; expressing the DNA segments in the host cells such that recombinant 
antibodies containing the amino acid sequences of the hit library are produced in the cells 
of the host organism; and selecting the recombinant antibody that binds to a target 
antigen with affinity higher than 10 6 M' 1 . 

20 The binding affinity of the selected recombinant antibody to the target antigen is 

optionally higher than 10 7 M' 1 , optionally higher than 10 8 NT 1 , optionally higher than 1 x 
10 9 M" 1 , optionally higher than 2 x 10 9 M' 1 , optionally higher than 5 x 10 9 M" 1 , optionally 
higher than 8 x 10 9 M* 1 , optionally higher than 1 x 10 10 M" 1 , optionally higher than 2 x 
10 10 M" 1 , optionally higher than 5 x 10 10 M"\ optionally higher than 8 x 10 10 M" 1 , or 

25 optionally higher than 1 x 10 11 M" 1 . 

The binding affinity of the selected antibody to its antigen may vary, depending 
the form of antibody being tested. The selected antibody being tested may be in the form 
of a single-chain antibody (scFv) comprising V H and V L designed by using the 
methodology of the present invention. Optionally, the selected antibody being tested may 

30 be in the form of a Fab comprising V H and V L designed by using the methodology of the 
present invention. Presumably due to its higher conformational flexibility and instability, 
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the binding affinity of the selected antibody in the form of scFv may be 1-2 magnitude 
lower than that in the form of Fab. Accordingly, for a selected antibody in the form of 
scFv, the dissociation constant is preferably lower than 10 6 M' 1 , optionally higher than 
10 7 M" 1 , optionally higher than 1 x 10 8 M*\ optionally higher than 2 x 10 8 M" 1 , optionally 
5 higher than 5 x 10 8 M" 1 , optionally higher- than 8 x 10 8 M" 1 , optionally higher than 1 x 10 9 
M' 1 , optionally higher than 2 x 10 9 M" 1 , optionally higher than 5 x 10 9 M" 1 , optionally 
higher than 8 x 10 9 M" 1 , optionally higher than 1 x 10 10 M" 1 , optionally higher than 5 x 
lO^M" 1 , or optionally higher than lx 10 11 M'\ 

The binding affinity of the selected recombinant antibody to the target antigen 

10 may also be represented by the dissociation constant K<i measured in a kinetic study of 
the binding interaction between the antibody and the target antigen at a certain 
temperature (e.g., 4°C, 25°C, 35°C, 37°C, or 42°C), for example, by using an instrument 
such as a BIAcore biosensor (see EXAMPLE). Generally, the lower measured, the 
higher affinity the antibody; and the higher temperature of measurement, the higher K<j 

15 measured for the same antibody. 

The K<i of a selected antibody is optionally lower than 100 nM, optionally lower 
than 10 nM, optionally lower than 8 nM, optionally lower than 8 nM, optionally lower 
than 5 nM, optionally lower than 1 nM, optionally lower than 0.8 nM, optionally lower 
than 0.5 nM, optionally lower than 0.2 nM, optionally lower than 0.1 nM, optionally 

20 lower than 0.08 nM, optionally lower than 0.05 nM, optionally lower than 0.01 nM, or 
optionally lower than 0.005 nM, in the form of scFv, Fab, or other form of antibody 
measured at a temperature of about 4°C, 25°C, 35°C, 37°C or 42°C. 

According to any of the above embodiments, the designed proteins (e.g.. 
antibodies) may be synthesized, or expressed in cells of any organism, including but not 

25 limited to bacteria, yeast, plant, insect, and mammal. Particular types of cells include, but 
are not limited to, Drosophila melanogaster cells, Saccharomyces cerevisiae and other 
yeasts, E. coli, Bacillus subtilis, SF9 cells, CI 29 cells, 293 cells, Neurospora, BHK, 
CHO, COS, and HeLa cells, fibroblasts, Schwanoma cell lines, immortalized mammalian 
myeloid and lymphoid cell lines, Jurkat cells, mast cells and other endocrine and exocrine 

30 cells, and neuronal cells. Examples of mammalian cells include, but are not limited to, 
tumor cells of all types (particularly melanoma, myeloid leukemia, carcinomas of the 



2369565 6.DOC 



-46- 



PATENT 

Attorney Docket No: 26050-709.501 

lung, breast, ovaries, colon, kidney, prostate, pancreas and testes), cardiomyocytes, 
endothelial cells, epithelial cells, lymphocytes (T-cell and B cell), mast cells, eosinophils, 
vascular intimal cells, hepatocytes, leukocytes including mononuclear leukocytes, stem 
cells such as haemopoetic, neural, skin, lung, kidney, liver and myocyte stem cells, 
5 osteoclasts, chondrocytes and other connective tissue cells, keratinocytes, melanocytes, 
liver cells, kidney cells, and adipocytes. 

Preferably, the designed protein is purified or isolated after expression according 
to methods known to those skilled in the art. Examples of purification methods include 
electrophoretic, molecular, immunological and chromatographic techniques, including 

10 ion exchange, hydrophobic, affinity, and reverse-phase HPLC chromatography, and 

chromatofocusing. The degree of purification necessary will vary depending on the use of 
the designed protein. In some instances no purification will be necessary. 

Also according to any of the embodiments described above, the designed proteins 
can be screened for a desired function, preferably a biological function such as their 

15 binding to a known binding partner, physiological activity, stability profile (pH, thermal, 
buffer conditions), substrate specificity, immunogenicity, toxicity, etc. 

In the screening using a cell-based assay, the designed protein may be selected 
based on an altered phenotype of the cell, preferably in some detectable and/or 
measurable way. Examples of phenotypic changes include, but are not limited to, gross 

20 physical changes such as changes in cell morphology, cell growth, cell viability, adhesion 
to substrates or other cells, and cellular density; changes in the expression of one or more 
RNAs, proteins, lipids, hormones, cytokines, or other molecules; changes in the 
equilibrium state (i.e. half-life) or one or more RNAs, proteins, lipids, hormones, 
cytokines, or other molecules; changes in the localization of one or more RNAs, proteins, 

25 lipids, hormones, cytokines, or other molecules; changes in the bioactivity or specific 
activity of one or more RNAs, proteins, lipids, hormones, cytokines, receptors, or other 
molecules; changes in the secretion of ions, cytokines, hormones, growth factors, or other 
molecules; alterations in cellular membrane potentials, polarization, integrity or transport; 
changes in infectivity, susceptability, latency, adhesion, and uptake of viruses and 

30 bacterial pathogens. 
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According to any of the above embodiment, the designed proteins (e.g.. 
antibodies) may be synthesized, or expressed as fusion proteins with a tag protein or 
peptide. The tag protein or peptide may be used to identify, isolate, signal, stabilize, 
increase flexibility of, increase degradation of, increase secretion, translocation or 
5 intracellular retention or enhance expression of the designed proteins. 

The invention further provides: isolated nucleic acid encoding the antibody; a 
vector comprising that nucleic acid, optionally operably linked to control sequences 
recognized by a host cell transformed with the vector; a host cell comprising that vector; 
a process for producing the antibody comprising culturing the host cell so that the nucleic 
10 acid is expressed and, optionally, recovering the antibody from the host cell culture. 

3. Method of Using the Antibodies of the Present Invention 

The antibodies designed by using the inventive methods may be used for 
15 diagnosing or therapeutic treatment of various diseases, including but not limited to, 
cancer, autoimmune diseases such as multiple sclerosis, rheumatoid arthritis, systemic 
lupus erythematosus, Type I diabetes, and myasthenia gravis, graft-versus-host disease, 
cardiovascular diseases, viral infection such as HIV, hepatitis viruses, and herpes simplex 
virus, bacterial infection, allergy, Type II diabetes, hematological disorders such as 
20 anemia. 

The antibodies can also be used as conjugates that are linked with diagnostic or 
therapeutic moieties, or in combination with chemotherapeutic or biological agents. 

The antibodies of the invention may be used as affinity purification agents. In this 
process, the antibodies are immobilized on a solid phase such a Sephadex resin or filter 

25 paper, using methods well known in the art. The immobilized antibody is contacted with 
a sample containing the VEGF protein (or fragment thereof) to be purified, and thereafter 
the support is washed with a suitable solvent that will remove substantially all the 
material in the sample except the VEGF protein, which is bound to the immobilized 
antibody. Finally, the support is washed with another suitable solvent, such as glycine 

30 buffer, pH 5.0, that will release the VEGF protein from the antibody. 
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Anti-VEGF antibodies may also be useful in diagnostic assays for VEGF protein, 
e.g., detecting its expression in specific cells, tissues, or serum. Such diagnostic methods 
may be useful in cancer diagnosis. 

For diagnostic applications, the antibody may be labeled with a detectable moiety. 
5 For example, the antibody can be labeled with a radioisotope, such as 35 S, 14 C, 125 1, 3 H, 
and 131 1, using the techniques described in Current Protocols in Immunology, Volumes 1 
and 2, Coligen et al., Ed. Wiley-Interscience, New York, New York, Pubs. (1991) for 
example and radioactivity can be measured using scintillation counting. 

The antibody may also be conjugated with a fluorescent label such as rare earth 
10 chelates (e.g., europium chelates), fluorescein and its derivatives, rhodamine and its 

derivatives, dansyl, Lissamine, phycoerythrin and Texas Red. The fluorescent labels can 
be conjugated to the antibody using the techniques disclosed in Current Protocols in 
Immunology, supra, for example. Fluorescence can be quantified using a fluorimeter. 

The antibody can be labeled with various enzyme-substrate labels such as those 
15 disclosed in U.S. Pat. No. 4,275,149. The enzyme generally catalyzes a chemical 

alteration of the chromogenic substrate which can be measured using various techniques. 
For example, the enzyme may catalyze a color change in a substrate, which can be 
measured spectrophotometrically. Alternatively, the enzyme may alter the fluorescence 
or chemiluminescence of the substrate. 
20 In one diagnostic application, the invention provides a method for determining the 

presence of VEGF protein comprising exposing a sample suspected of containing the 
VEGF protein to the anti-VEGF antibody and determining binding of the antibody to the 
sample. For this use, the invention provides a kit comprising the antibody and 
instructions for using the antibody to detect the VEGF protein. 
25 The antibodies of the present invention can also be formulated for delivery via a 

wide variety of routes of administration. Therapeutic formulations of the antibody are 
prepared for storage by mixing the antibody having the desired degree of purity with 
optional physiologically acceptable carriers, excipients or stabilizers (Remington's 
Pharmaceutical Sciences 16th edition, Osol, A. Ed. (1980)), in the form of lyophilized 
30 formulations or aqueous solutions. For example, the antibodies may be administered or 
coadministered orally, topically, parenterally, intraperitoneal^, intravenously, 
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intraarterially, transdermally, sublingually, intramuscularly, rectally, transbuccally, 
intranasally, via inhalation, vaginally, intraoccularly, via local delivery (for example by a 
catheter or a stent), subcutaneously, intraadiposally, intraarticularly, or intrathecally. 
Depending on the type and severity of the disease, about 1 ug/kg to about 50 
5 mg/kg (e.g., 0.1-20 mg/kg, 0.5-15 mg/Kg, and 1-10 mg/kg) of antibody is an initial 

candidate dosage for administration to the patient, whether, for example, by one or more 
separate administrations, or by continuous infusion. A typical daily or weekly dosage 
might range from about 1 ug/kg to about 20 mg/kg or more, depending on the factors 
mentioned above. For repeated administrations over several days or longer, depending 

10 on the condition, the treatment is repeated until a desired suppression of disease 

symptoms occurs. However, other dosage regimens may be useful. The progress of this 
therapy is easily monitored by conventional techniques and assays, including, for 
example, radiographic tumor imaging. 

The anti-VEGF antibodies of the present invention may be used to treat a wide 

15 variety of indications for anti-VEGF antibodies have therapeutic activity. Such 

indications include, but are not limited to, restenosis (e.g. coronary, carotid, and cerebral 
lesions), benign tumors, a various types of cancers such as primary tumors and tumor 
metastasis, abnormal stimulation of endothelial cells (atherosclerosis), insults to body 
tissue due to surgery, abnormal wound healing, abnormal angiogenesis, diseases that 

20 produce fibrosis of tissue, muscular degeneration, repetitive motion disorders, disorders 
of tissues that are not highly vascularized, and proliferative responses associated with 
organ transplants. 

Examples of benign tumors include hemangiomas, hepatocellular adenoma, 
cavernous haemangioma, focal nodular hyperplasia, acoustic neuromas, neurofibroma, 
25 bile duct adenoma, bile duct cystanoma, fibroma, lipomas, leiomyomas, mesotheliomas, 
teratomas, myxomas, nodular regenerative hyperplasia, trachomas and pyogenic 
granulomas. 

Specific types of cancers include, but are not limited to, leukemia, breast cancer, 
skin cancer, bone cancer, prostate cancer, liver cancer, lung cancer, brain cancer, cancer 
30 of the larynx, gallbladder, pancreas, rectum, parathyroid, thyroid, adrenal, neural tissue, 
head and neck, colon, stomach, bronchi, kidneys, basal cell carcinoma, squamous cell , 
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carcinoma of both ulcerating and papillary type, metastatic skin carcinoma, osteo 
sarcoma, Ewing ? s sarcoma, veticulum cell sarcoma, myeloma, giant cell tumor, small-cell 
lung tumor, gallstones, islet cell tumor, primary brain tumor, acute and chronic 
lymphocytic and granulocytic tumors, hairy-cell tumor, adenoma, hyperplasia, medullary 
5 carcinoma, pheochromocytoma, mucosal neuronms, intestinal ganglioneuromas, 

hyperplastic corneal nerve tumor, marfanoid habitus tumor, Wilm's tumor, seminoma, 
ovarian tumor, leiomyomater tumor, cervical dysplasia and in situ carcinoma, 
neuroblastoma, retinoblastoma, soft tissue sarcoma, malignant carcinoid, topical skin 
lesion, mycosis fungoide, rhabdomyosarcoma, Kaposi's sarcoma, osteogenic and other 

10 sarcoma, malignant hypercalcemia, renal cell tumor, polycythemia vera, 

adenocarcinoma, glioblastoma multiforma, leukemias, lymphomas, malignant 
melanomas, epidermoid carcinomas, and other carcinomas and sarcomas. 

Diseases associated with abnormal angiogenesis include, but are not limited to, 
rheumatoid arthritis, ischemic-reperfusion related brain edema and injury, cortical 

15 ischemia, ovarian hyperplasia and hypervascularity, (polycystic ovary syndrom), 

endometriosis, psoriasis, diabetic retinopaphy, and other ocular angiogenic diseases such 
as retinopathy of prematurity (retro lental fibroplastic), macular degeneration, corneal 
graft rejection, neuroscular glaucoma and Oster Webber syndrome. In a particular 
embodiment, the anti-VEGF antibodies of the present invention can be used for treating 

20 age-related macular degeneration (AMD). 

Examples of retinal/choroidal neovascularization include, but are not limited to, 
Bests diseases, myopia, optic pits, Stargarts diseases, Pagets disease, vein occlusion, 
artery occlusion, sickle cell anemia, sarcoid, syphilis, pseudoxanthoma elasticum carotid 
abostructive diseases, chronic uveitis/vitritis, mycobacterial infections, Lyme's disese, 

25 systemic lupus erythematosis, retinopathy of prematurity, Eales disease, diabetic 

retinopathy, macular degeneration, Bechets diseases, infections causing a retinitis or 
chroiditis, presumed ocular histoplasmosis, pars planitis, chronic retinal detachment, 
hyperviscosity syndromes, toxoplasmosis, trauma and post-laser complications, diseases 
associated with rubesis (neovascularization of the ankle) and diseases caused by the 

30 abnormal proliferation of fibrovascular or fibrous tissue including all forms of 
proliferative vitreoretinopathy. 
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Examples of corneal neovascularization include, but are not limited to, epidemic 
keratoconjunctivitis, Vitamin A deficiency, contact lens overwear, atopic keratitis, 
superior limbic keratitis, pterygium keratitis sicca, sjogrens, acne rosacea, 
phylectenulosis, diabetic retinopathy, retinopathy of prematurity, corneal graft rejection, 
5 Mooren ulcer, Terrien's marginal degeneration, marginal keratolysis, polyarteritis, 

Wegener sarcoidosis, Scleritis, periphigoid radial keratotomy, neovascular glaucoma and 
retrolental fibroplasia, syphilis, Mycobacteria infections, lipid degeneration, chemical 
burns, bacterial ulcers, fungal ulcers, Herpes simplex infections, Herpes zoster infections, 
protozoan infections and Kaposi sarcoma. 
10 The anti-VEGF antibodies of the present invention may be used in combination 

with an anti-angiogenesis agent for the treatment of diseases associated with abnormal 
angiogenesis. 

Examples of anti-angiogenesis agents include, but are not limited to, retinoid acid 
and derivatives thereof, 2-methoxyestradiol, ANGIOSTATIN™ protein, 

15 ENDOSTATIN™ protein, suramin, squalamine, tissue inhibitor of metalloproteinase-I, 
tissue inhibitor of metalloproteinase-2, plasminogen activator inhibitor- 1, plasminogen 
activator inhibitor-2, cartilage-derived inhibitor, paclitaxel, platelet factor 4, protamine 
sulphate (clupeine), sulphated chitin derivatives (prepared from queen crab shells), 
sulphated polysaccharide peptidoglycan complex (sp-pg), staurosporine, modulators of 

20 matrix metabolism, including for example, proline analogs ((l-azetidine-2-carboxylic acid 
(LAC A), cishydroxyproline, d,l-3,4-dehydroproline, thiaproline], a, a -dipyridyl, P- 
aminopropionitrile fumarate, 4-propyl-5-(4-pyridinyl)-2(3h)-oxazolone; methotrexate, 
mitoxantrone, heparin, interferons, 2 macroglobulin-serum, chimp-3, chymostatin, beta.- 
cyclodextrin tetradecasulfate, eponemycin; fumagillin, gold sodium thiomalate, d- 

25 penicillamine (CDPT), beta.-l-anticollagenase-serum, D2-antiplasmin, bisantrene, 

lobenzarit disodium, n-(2-carboxyphenyl-4-chloroanthronilic acid disodium or "CCA", 
thalidomide; angostatic steroid, cargboxynaminolmidazole; metalloproteinase inhibitors 
such as BB94. Other anti-angiogenesis agents include antibodies, such as monoclonal 
antibodies against these angiogenic growth factors: bFGF, aFGF, FGF-5, VEGF 

30 isoforms, VEGF-C, HGF/SF and Ang-l/Ang-2. 
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The anti-VEGF antibodies of the present invention, preferably those having 
therapeutic synergistic effects with the anti-VEGF antibodies, may be employed in 
combination with the anti-VEGF antibodies to further enhance the therapeutic effects of 
these two types of drug. Examples of the therapeutic agent include, but are not limited 
5 to, alkylating agents, antibiotic agents, antimetabolic agents, hormonal agents, plant- 
derived agents, and biologic agents. 

Examples of alkylating agents include, but are not limited to, 
bischloroethylamines (nitrogen mustards, e.g. chlorambucil, cyclophosphamide, 
ifosfamide, mechlorethamine, melphalan, uracil mustard), aziridines (e.g. thiotepa), alkyl 
10 alkone sulfonates (e.g. busulfan), nitrosoureas (e.g. carmustine, lomustine, streptozocin), 
nonclassic alkylating agents (altretamine, dacarbazine, and procarbazine), platinum 
compounds (carboplastin and cisplatin). 

Examples of antibiotic agents include, but are not limited to, anthracyclines (e.g. 
doxorubicin, daunorubicin, epirubicin, idarubicin and anthracenedione), mitomycin C, 
15 bleomycin, dactinpmycin, plicatomycin. 

Examples of antimetabolic agents include, but are not limited to, fluorouracil (5- 
FU), floxuridine (5-FUdR), methotrexate, leucovorin, hydroxyurea, thioguanine (6-TG), 
mercaptopurine (6-MP), cytarabine, pentostatin, fludarabine phosphate, cladribine (2- 
CDA), asparaginase, imatinib mesylate (or GLEEVAC®), and gemcitabine. 
20 Examples of such hormonal agents are synthetic estrogens (e.g. diethylstibestrol), 

antiestrogens (e.g. tamoxifen, toremifene, fluoxymesterol and raloxifene), antiandrogens 
(bicalutamide, nilutamide, flutamide), aromatase inhibitors (e.g., aminoglutethimide, 
anastrozole and tetrazole), ketoconazole, goserelin acetate, leuprolide, megestrol acetate 
and mifepristone. 

25 Examples of plant-derived agents include, but are not limited to, vinca alkaloids 

(e.g., vincristine, vinblastine, vindesine, vinzolidine and vinorelbine), podophyllotoxins 
(e.g., etoposide (VP-16) and teniposide (VM-26)), camptothecin compounds (e.g., 20(S) 
camptothecin, topotecan, rubitecan, and irinotecan), taxanes (e.g., paclitaxel and 
docetaxel). 

30 Examples of biologic agents include, but are not limited to, immuno-modulating 

proteins such as cytokines, monoclonal antibodies against tumor antigens, tumor 
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suppressor genes, and cancer vaccines. Examples of interleukins that may be used in 
conjunction with the antibody of the present invention include, but are not limited to, 
interleukin 2 (IL-2), and interleukin 4 (IL-4), interleukin 12 (IL-12). Examples of 
interferons that may be used in conjunction with the antibody of the present invention 
5 include, but are not limited to, interferon a, interferon b (fibroblast interferon) and 
interferon g (fibroblast interferon). Examples of such cytokines include, but are not 
limited to erythropoietin (epoietin a), granulocyte-CSF (filgrastin), and granulocyte, 
macrophage-CSF (sargramostim). Other immuno-modulating agents other than cytokines 
include, but are not limited to bacillus Calmette-Guerin, levamisole, and octreotide. 

10 The anti-VEGF antibodies of the present invention may also be combined with a 

tumor necrosis factor (TNF) or its mutein in the treatment of the above diseases or 
conditions. The administration of anti-VEGF antibody and TNF is repeated until the 
desired clinical effect is achieved. In instances where solid tumors are found in the limbs 
or in other locations susceptible to isolation from the general circulation, the antibody 

1 5 and/or TNF may be administered to the isolated tumor or organ. In other embodiments, a 
FGF or platelet-derived growth factor (PDGF) antagonist, such as an anti-FGF or an anti- 
PDGF neutralizing antibody, is administered to the patient in conjunction with the anti- 
VEGF antibody. Treatment with anti-VEGF antibodies optimally may be suspended 
during periods of wound healing or desirable neovascularization. 

20 Example of monoclonal antibodies against tumor antigens that can be used in 

conjunction with the anti-VEGF antibodies of the present invention include, but are not 
limited to, HERCEPTIN® (Trastruzumab), RITUXAN® (Rituximab), MYLOTARG® 
(gemtuzumab ozogamicin), CAMPATH® (alemtuzumab), ZEVALIN® (ibritumomab 
yiuxetan), PANOREX® (edrecolomab), BEXXAR® (tositumomab), ERBITUX® 

25 (cetuximab), and AVASTIN® (bevacizumab). 

Examples of the tumor suppressor genes include, but are not limited to, DPC-4, 
NF-1, NF-2, RB, p53, WT1, BRCA1 and BRCA2. 

Example of cancer vaccines include, but are not limited to gangliosides (GM2), 
prostate specific antigen (PSA), □ -fetoprotein (AFP), carcinoembryonic antigen (CEA) 

30 (produced by colon cancers and other adenocarcinomas, e.g. breast, lung, gastric, and 
pancreas cancer s), melanoma associated antigens (MART-1, gplOO, MAGE 1,3 
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tyrosinase), papillomavirus E6 and E7 fragments, whole cells or portions/lysates of 
antologous tumor cells and allogeneic tumor cells. 

An adjuvant may be used to augment the immune response to TAAs. Examples 
of adjuvants include, but are not limited to, bacillus Calmette-Guerin (BCG), endotoxin 
5 lipopolysaccharides, keyhole limpet hemocyanin (GKLH), interleukin-2 (IL-2), 
granulocyte-macrophage colony-stimulating factor (GM-CSF) and Cytoxan, a 
chemotherapeutic agent which is believe to reduce tumor-induced suppression when 
given in low doses. 

10 
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EXAMPLES 

1. In Silico design of Anti-VEGF Antibodies for Human ization and Affinity 
Maturation 

5 

The methodology provided in the present invention was used to design libraries 
for humanization of non-human antibodies and to optimize the affinity and other 
attributes of antibodies. Novel variants of amino acids and nucleic acids of antibodies 
were generated with human or human like sequences while their binding affinity, 

10 stability, expression are improved significantly. 

The inventive process was carried out computationally in a high throughput 
manner by mining the ever-expanding databases of protein sequences of all organisms, 
especially human and by relating their specific sequences or their variants with functional 
enhancement such as binding affinity and stability that are tested experimentally. By 

15 using the inventive methodology, an expanded and yet functionally biased library of 
antibodies were constructed based on computational evaluation of extremely diverse 
protein sequences and functionally relevant structures in silico and subsequently tested by 
experimental screening and selection in vitro or in vivo. 

In general, the method was implemented in a computer through in silico selection 

20 of protein sequences based on the amino acid sequence of a target structural/functional 
motif or domain in a lead protein, herein after referred to as the "lead sequence". The 
lead sequence was employed to search databases of protein sequences. The choice of the 
database depends on the specific functional requirement of the designed motifs. For 
example, if the lead protein is an enzyme and the target motif includes the active site of 

25 the enzyme, databases of proteins/peptides of a particular origin, organism, species or 
combinations thereof, may be queried using various search criteria to yield a hit list of 
sequences each of which can substitute the target motif in the lead protein. A similar 
approach may be used for designing other motifs or domains of the lead protein. The 
designed sequences for each individual motif7domain may be combined to generate a 

30 library of designed proteins. In addition, to reduce immunogenicity of the designed 

proteins for human applications such as therapeutics or diagnosis, databases of proteins 
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of human origin or humanized proteins are preferably searched to yield the hit list of 
sequences, especially for motifs derived from sites of the lead protein that serves as the 
scaffolding of the lead proteins such as the frameworks of an antibody. The library of 
designed proteins can be tested experimentally to yield proteins with improved biological 
5 function(s) over the lead protein. 

In this example, the inventive methodology was implemented in designing anti- 
VEGF antibodies that are diverse in sequence and yet functionally related to each other. 
Based on the designed antibody sequences, a library of antibodies were constructed to 
include diverse sequences in the complementary determining regions (CDRs) and/or 

10 humanized frameworks (FRs) of a non-human antibody in a high throughput manner. 
This library of antibodies were screened against human VEGF for improved function 
such as binding affinity and pharmacokinetic property. 

In designing the anti-VEGF antibodies, in silico selection of antibody sequences 
was based on the amino acid sequence of a region or motif in a lead anti-VEGF antibody, 

15 herein after referred to as the "lead sequence". The lead sequence was employed to 
search databases of protein sequences. The choice of the database depends on the 
specific functional requirement of the designed motifs. For example: in order to design 
the framework regions of variable chains for therapeutic application, collections of 
protein sequences that are evolutionarily related such as fully human immunoglobulin 

20 sequences and human germline immunoglobulin sequences would be used except for a 
few structurally critical sites. This would reduce the immunogenic response by 
preserving the origin of the sequences by introducing as few foreign mutants as possible 
in this highly conserved region (for framework regions). On the other hand, diverse 
sequence databases such as immunoglobulin sequences of various species or even 

25 unrelated sequence in genbank can be used to design the CDRs in order to improve 
binding affinity with antigens in this highly variable region. By using the method, a 
library of diverse antibody sequences were constructed and screened experimentally in 
vitro and/or in vivo for antibody mutants with improved or desired function(s). 

30 1) Anti-VEGF antibody libraries designed in silico for affinity maturation 
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The amino acid sequence of the variable heavy chain regions of a murine anti- 
VEGF antibody and the segments for frameworks and CDRs (underlined and annotated) 
is: 

EIOLVOSGPELKOPGETVRISCKAS GYTFTNYGMN (VH/CDRnWVKOAPGKGLK 
5 WMG WINTYTGEPTYAADFKR (VH/CDR2)RFTFSLETSASTAYLOISNLK^ 
YFC AKYPHYYGSSHWYFDV rVH/CDR3)WGAGTTVTVSS (SEQ ID NO:283) 

This V H sequence therein after referred to as the "parental anti-VEGF antibody 55 . 
The frameworks and CDRs are designated according to the Kabat criteria (Kabat EA, 
Redi-Miller M, Perry HM, Gottesman KS (1987) Sequences of Proteins of 
10 Immunological Interest 4 th edit, National Institutes of Health, Bethesda, MD). 

The CDR and framework regions of the antibody were targeted using a modular 
in silico evolutionary design approach as described in more detail in U.S. Patent 
Application Serial Nos: 10/443,134, 10/153,159, 10/153,176, 10/125,687, and 
60/284,407, which are incorporated herein by reference in their entirety. Using murine 
15 anti-VEGF antibody as the lead protein and its V H CDR3 as the lead sequence, digital 
libraries of V H CDR3 were constructed by following the procedure. 

The lead sequence includes V H CDR3 of the parental anti-VEGF antibody and a 
few amino acid residues from the adjacent framework regions 
CAKYPHYYGSSHWYFDVWG. A hit library was constructed by searching and 

20 selecting hit amino acid sequences to V H CDR3 from a sequence database. Variant 
profile was built to list all variants at each position based on the hit library and filtered 
with certain cutoff value to reduce of the size of the resulting hit variant library within 
computational or experimental limit. Variant profiles were also built in order to facilitate 
i) the sampling of the sequence space that covers the preferred region in the fitness 

25 landscape; ii) the partitioning and synthesis of degenerate nucleic acid libraries that target 
the preferred peptide ensemble sequences; iii) the experimental screening of the antibody 
libraries for the desired function; and iv) the analysis of experimental results with 
feedback for further design and optimization. 

The lead structural templates were obtained from the available X-ray structures of 

30 the complexes formed between VEGF and anti-VEGF antibodies. The complex structure 
of VEGF and parental anti-VEGF antibody is designated as 1BJ1, and that formed 
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between VEGF and matured anti-VEGF antibody 1CZ8. The results from 1CZ8 
structural template were similar to those from 1BJ1 in the relative ranking order of the 
scanned sequences. Structural models of anti-VEGF antibodies can be also used. 

The lead sequence for V H CDR3 is taken from the parental anti-VEGF antibody 
5 according to Kabat classification with amino acid residues CAK and WG from the 
adjacent framework regions flanking the V H CDR3 sequence at N- and C- terminus, 
respectively. Only V H CDR3 sequence of the parental antibody was used to build the 
HMM for searching the protein databases. 

The HMM built using the single lead sequence or sequences of the structural 

10 ensembles was calibrated and used to search the Kabat database (Johnson, G and Wu, TT 
(2001) Nucleic Acids Research, 29, 205-206). All sequence hits that are above 
expectation value or E-value are listed and aligned using HAMMER 2.1.1 package. 
After removing the redundant sequences from the hit list, the remaining hit sequences for 
the lead HMM form the hit library. 

1 5 The variant profile at each position was used to build the AA-PVP table (amino 

acid positional variant profile), which gives the number of occurrence of each amino acid 
residue at each position. 

The variant profile can filtered to remove variants that occur at or less than the 
certain cutoff frequency and/or in combination with variant reprofiling using structure- 

20 based scoring. The variant profile from the sequence pool provides informative data to 
identify the positions in the lead sequence that can be either varied or fixed. The sites can 
be divided into three categories: i) Structurally conserved sites remain conserved over 
evolution. The high frequency residues can be used to maintain the scaffold of the target 
motif at these positions; ii) variable functional hot spots should be targeted with focused 

25 mutagenesis; iii) combination of both i) and ii) to stabilize the target scaffold while 
simultaneously providing variability in the functional hot spots. 

A set of the amino acids from the functional variants should be included at the 
functional hot spots according to their frequencies in the variant profile because they are 
evolutionarily selected or optimized. Furthermore, the variants at each position can be 

30 filtered or prioritized to include other potentially beneficial mutants or exclude 

potentially undesirable mutants to meet the computational and experimental constraints. 
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Although the variant profile is informative on the preferred amino acid residues at 
each position and specific mutants in a preferred order, unmodified, it embodies an 
enormous number of recombinants. Some filtering using frequency cutoff can reduce the 
combinatorial sequences that need to be evaluated by computational screening or targeted 
5 directly by experimental libraries. 

A structure-based scoring was applied to screen the hit library and its 
combinatorial sequences that form a hit variant library. Side chains of V H CDR3 of the 
parental anti-VEGF antibody were substituted by rotamers of corresponding amino acid 
variants from the hit variant library at each residue position. The conformations of 
10 rotamers were built and optimized by using the program SCWRL® (version 2. 1) using 
backbone-dependent rotamer library (Bower MJ, Cohen FE, Dunbrack RL (1997) JMB 
267, 1268-82). 

The scoring was done by searching the optimal rotamers and minimizing the 
energy by 100-200 steps using the Amber94 force field in CONGEN [Bruccoleri and 

15 Karplus (1987) Biopolymers 26:137-168] in the presence and absence of the structure of 
the antigen VEGF. The energy scores of an anti-VEGF variant library based on the 
calculated scores with and/without VEGF antigen indicate there are a large number of 
sequences for various variant libraries with higher scores than the parental sequences. 
A refined custom scoring function that includes sidechain entropy, nonpolar 

20 solvation energy and electrostatic solvation energy can be used to gauge the simple 
functions used for scoring. Three energy terms were calculated: sidechain entropy, 
nonpolar solvation energy and electrostatic solvation energy and the backbone entropy 
for loops as well. 

The nonelectrostatic solvation energy is made proportional to the molecular 
25 surface, as calculated by the GEPOL93 algorithm, with the scaling constant of 70 
cal/mol/A 2 (Tunon I, Silla E, Pascual-Ahuir JL (1992) Prot Eng 5, 715-716) using 
GEPOL (Pascual-Ahuir JL, Silla E (1993) J Comput Chem 11, 1047-1060) command as 
implemented in CONGEN. 

The electrostatic solvation energy is calculated using the finite-difference PB 
30 (FDPB) method as implemented in UHBD program (Davis ME, Madura JD, Luty BA, 
McCammon JA (1991) Comput Phys Commun 62, 187-197). The focusing method is 
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used for the region surrounding the mutation. An automated protocol generates three 
grids: coarse, fine, and focus grids. The grid units are 1.5, 0.5, and 0.25 angstroms, 
respectively. The focusing grid is a cubic grid that spans the Cartesian volume occupied 
by the mutated residues. The fine grid is a cubic grid that spans the entire volume of the 
5 protein or the complex. The coarse grid is a cubic grid that is set to approximately twice 
the size of the fine grid in each axis and covers approximately 8 times the volume of the 
fine grid. The coarse grid serves to account for the long-range solvent effects and sets the 
boundary conditions for the fine grid. Similarly, the fine grid accounts for the 
electrostatic contributions of the protein interior and sets the boundary condition for the 

10 focus grid. The focus grid accounts for finer details of the localized effects due to the 
mutation. The dielectric constants for the protein interior and exterior are set to 4 and 78, 
respectively. Temperature is set to 300 Kelvin and ionic strength is set to 150 mM. 
Maximum iteration is set to 200. The calculations are repeated with a uniform dielectric 
so that both the interior and exterior dielectrics are set to 4 and the difference between the 

15 two energies is computed. The latter calculations represent the energies due to bringing 
the charges onto the grids. 

The custom scoring function or the molecular mechanics energy using Amber94 
forcefield in CONGEN plus the solvation terms from PB in UHBD used here is similar to 
MM-PBSA or MM-GBSA. The energy function shows better agreement with 

20 experimental data (Sharp KA. (1998) Proteins 33, 39-48; Novotny J, Bruccoleri RE, 
Davis M, Sharp KA (1997) J Mol Biol 268, 401-41 1), especially when structure 
ensembles by molecular dynamics calculations are used to provide more accurate 
methods to score sequence and its variants based on the ensemble averages of the energy 
functions (Kollman PA, Massova I, Reyes C, Kuhn B, Huo SH, Chong LT, Lee M, Lee 

25 TS, Duan Y, Wang W, Donini O, Cieplak P, Srinivasan P, Case DA, and Cheatham TE 
(2000) Acc. Chem Res. 33, 889-897). 

The variant profile from the hit variant library as described above was filtered in 
order to reduce the potential library size while maintaining most of the preferred residues. 
A strategy that selects top sequences based on favorable score and/or the presence of 

30 residues likely to participate in favorable interactions was employed to identify a cluster 
or clusters of amino acid sequences for the nucleic acid library design. As described 
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above, a cluster of computationally screened sequences for V H CDR3, CDR2 and CDR1, 
respectively, was chosen for further experimental test in vitro. 

The hit variant library constructed above was targeted with a single degenerate 
nucleic acid library. The nucleic acid sequence profile was built using the optimal 
5 codons for specific organism that are combined with minimal sizes. A degenerate nucleic 
acid library was synthesized by incorporating a mixture of bases into each degenerate 
position. As a result of the combinatorial effect of the synthesis, this degenerate nucleic 
acid library encodes an expanded amino acid library. 

Variants at each position for VH and VL are shown in the VH and VL amino acid 

10 position variant profile, respectively. Detailed results for Vh and/or Vl CDR3, CDR2, 
and CDR1, FR123 were summarized in anti-VEGF VH and VL Tables. These 
experiments demonstrated that by using the methods of the present invention, antibodies 
could be selected, not only with diverse sequences and phylogenic distances, but also 
with improved binding affinity to bind to. the target VEGF antigen. 

15 Figure 1A is an amino acid positional variant profile (AA-PVP) for Vl of anti- 

VEGF antibodies designed using the methodology described above. The AA-PVP of the 
V L chain uses the humanized V L (SEQ ID NO:l) by Baca et al. (1997) J Biol Chem 272: 
10678-10684 as a reference sequence. The dot indicates the same amino acid as 
reference. CDRs are indicated in bold letters underlined. 

20 Figure IB is an amino acid positional variant profile (AA-PVP) for V H of anti- 

VEGF antibodies designed using the methodology described above. The AA-PVP of the 
V H chain uses the humanized V H (SEQ ID NO:55) by Baca et al. (1997), supra as a 
reference sequence. The dot indicates the same amino acid as reference. CDRs are 
indicated in bold letters underlined. 

25 The antibody libraries that were designed in silico, based on a lead sequence of 

the parental anti-VEGF antibody by using the methods described above were tested for 
their ability to bind to the antigen, VEGF, by using a novel phage display system. 

2) Generation of anti-VEGF antibody libraries for framework optimization 

30 
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A murine monoclonal antibody was found to block VEGF-dependent cell 
proliferation and slow the tumor growth in vivo (Kim KJ, Li B, Winer J, Armanini M, 
Gillett N, Phillips HS, Ferrara N (1993) Nature 362, 841-844). This murine antibody was 
humanized (Presta LG, Chen H, O'Connor SJ, Chisholm V, Meng YG, Krummen L, 
5 Winkler M, Ferrara N (1997) Cancer Res. 57, 4593-4599; Baca M, Presta LG, O'Connor 
SJ, Wells JA (1997) J Biol Chem 272, 10678-10684) using random mutagenesis at some 
key framework positions following grafting of antigen-binding loops. Typically, after 
rounds of site-directed mutagenesis and selection, humanized antibodies are generated by 
replacing a human or concensus human framework with non-human amino acids from the 

10 parental non-human antibody at certain pre-determined key positions. These humanized 
antibodies will usually bind to its cognate antigen of its parental antibody with the 
reduced affinity relative its parental antibody (about 6-fold weaker for humanized anti- 
VEGF relative its parental murine antibody, see Baca M, Presta LG, O'Connor SJ, Wells 
JA (1997) J Biol Chem 272, 10678-10684, and 2-fold weaker for another version of the 

15 humanized anti-VEGF, see Presta LG, Chen H, O'Connor SJ, Chisholm V, Meng YG, 
Krummen L, Winkler M, Ferrara N (1997) Cancer Res. 57, 4593-4599; Baca M, Presta 
LG, O'Connor SJ, Wells JA (1997) J Biol Chem 272, 10678-10684). This loss of 
binding affinity would be recovered by using affinity maturation in CDRs (Chen Y, 
Wiesmann C, Fuh G, Li B, Christinger HW, McKay P, de Vos AM, Lowman HB (1999) 

20 J. Mol Biol 293, 865-881). 

Using present inventive methods described, we have discovered several 
humanized frameworks that are several-folder higher in binding affinity (3. 3 -fold for 
hAB2, 4-fold for hAB3 and 2-fold for hAB4) upon framework optimization than the 
reported humanized anti-VEGF antibody sequence (see hABl in Figure 2 for the 

25 humanized anti-VEGF antibody framework reported in the literature (Presta LG, Chen H, 
O'Connor SJ, Chisholm V, Meng YG, Krummen L, Winkler M, Ferrara N (1997) Cancer 
Res. 57, 4593-4599). Because the reported humanized anti-VEGF antibody is - 2 times 
weaker than its corresponding murine antibody, our humanized antibodies (hAB2 and 
hAB3) should have ~2-fold higher binding affinity upon humanization than the 

30 corresponding murine antibody. Also, we can achieve such improvement by either 
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increasing the on-rate (Kon) or decreasing the off-rate (K^) or both of the humanized 
antibodies relative to the reported humanized antibody (hABl) (see Figure 2). 



The amino acid sequence of the framework frl23 region of the murine anti-VEGF 
5 antibody is: 

EIQLVQSGPELKQPGETVRISCKASWVKQAPGKGLKWMGRFTFSLETSASTAYL 
QISNLKNDDTATYFCA. 

This sequence is therein after referred to as frl23 of "murine anti-VEGF 
antibody", see Presta LG, Chen H, O'Connor SJ, Chisholm V, Meng YG, Krummen L, 

1 0 Winkler M, Ferrara N (1 997) Cancer Res. 57, 4593-4599). The relatively constant 
framework 4 can be designed if desired using the same approach. The framework and 
CDRs are designated according to the Kabat criteria (Kabat EA, Redi-Miller M, Perry 
HM, Gottesman KS (1987) Sequences of Proteins of Immunological Interest 4 th edit, 
National Institutes of Health, Bethesda, MD), although other classification can be used 

15 also. Also, separate segment of framework FR1 or FR2 or FR3 and FR4 can be designed 
individually and pasted together if desired. The combination of CDRs and FRs can be 
designed simultaneously by designing each segment or combinations of segments used 
the approach described here. The CDRs are the same as in SEQ283 from the murine 
anti-VEGF or those designed using the approach described here. However, different 

20 CDRs can be also designed and used in combination with the designed FR123 libraries. 
The variant profiles for the hit library are generated using the human V H germline 
sequences based on the lead sequence of V H FR123 of the murine anti-VEGF antibody 
and are filtered by using certain cutoff values. The variant at each position can be ranked 
based on its structural compatibility with the antibody structure using total energy or 

25 other scoring terms. Some reference amino acids are found to be favorable at certain 
positions based on their total energy or specific packing, although their occurrence 
frequency is very low. The variant profiles for the hit library can be generated and 
refined using the Kabat-derived human VH sequences based on the lead sequence of V H 
FR123 of the murine anti-VEGF antibody. The filtered variant profile can be further 

30 screened computationally to reflect the ranking order of the structural compatibility if 
only the antibody structure is used. Although the human vs non-human sequences differ 
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in many positions across the entire chain for Vh, the amino acid libraries used in other 
humanization approach are focused on randomization at a few positions flanking grafted 
CDRs, whereas in a preferred embodiment, the humanization library targets various 
positions across both VH and VL chains with a few mutants at those positions for the 
5 starting anti-VEGF antibody. 

In a preferred embodiment, each motif such as frameworks FR1, FR2, FR3 and 
FR4 or its combination such as FR123 of the antibody can be targeted using a modular in 
silico evolutionary design approach. It has been understood that there are only a limited 
number of conformations (called canonical structures) for each motif or its combination. 

10 These structural features of an antibody provide an excellent system for testing the 

evolutionary sequence design by using structured motifs at various regions of an antibody 
based on the extensive analysis of antibody structures. These structure and sequence 
conservation are observed across different species. In fact, the scaffolding of antibodies, 
or the immunoglobin fold, is one of the most abundant structure observed in nature and is 

1 5 highly conserved among various antibodies and related molecules. 

In a preferred embodiment, the method can be also used to design antibody 
framework using sequence-based approach or structure ensembles that contain the 
induced structure changes in CDRs. Using murine anti-VEGF antibody framework as the 
lead protein and its V H FR123 as the lead sequence, digital libraries of V H FR123 were 

20 constructed. 

In a preferred embodiment, a hit library was constructed by searching and 
selecting hit amino acid sequences using V H FR123 as the lead sequence. Variant profile 
was built to list all variants at each position based on the hit library and filtered with 
certain cutoff value to reduce of the size of the resulting hit variant library within 

25 computational or experimental limit. Variant profiles were also built in order to facilitate 
i) the sampling of the sequence space that covers the preferred region in the fitness 
landscape; ii) the partitioning and synthesis of degenerate nucleic acid libraries that target 
the preferred peptide ensemble sequences; iii) the experimental screening of the antibody 
libraries for the desired function; and iv) the analysis of experimental results with 

30 feedback for further design and optimization. 
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The lead structural templates were obtained from the available X-ray structures of 
the complexes formed between VEGF and anti-VEGF antibodies. The complex structure 
of VEGF and parental anti-VEGF antibody is designated as 1BJ1, and that formed 
between VEGF and matured anti-VEGF antibody 1CZ8. The results from 1CZ8 
5 structural template were similar to those from 1BJ1 in the relative ranking order of the 
scanned sequences. The modeled structure or structure ensemble or ensemble average 
can be also used for screening sequences. The lead sequence for Vh FR123 (SEQ283) is 
taken from the murine anti-VEGF antibody according to Kabat classification. The HMM 
built using the single lead sequence was calibrated and used to search human heavy chain 

10 germline sequence database and/or human sequence database (including human 

germlines and humanized sequences) derived from Kabat database (Johnson, G and Wu, 
TT (2001) Nucleic Acids Research, 29, 205-206). All sequence hits that are above 
expectation value or E-value are listed and aligned using HAMMER 2.1.1 package. 
After removing the redundant sequences from the hit list, the remaining hit sequences for 

15 the lead HMM form the hit library. The sequence identities of the hit sequences from the 
human Vh germline ranges from 40 to 68% of the lead sequence, whereas the 
corresponding sequence identities of the hit sequences from human immunoglobin 
sequences derived from Kabat database (the database are parsed to frl23 fragment in 
order to increase the sensitivity of the search and their relative ranking) (other database 

20 would be used if the contain the immunoglobin sequences of human origins) ranging 

from -30 to 75%. The evolutionary distances between the hits can be analysed by using 
the program TreeViewl.6.5 ( http://taxonomv.zoology.gla.ac.uk/rod/rod.htmn . 

The AA-PVP tables give the number of occurrence of each amino acid residue at 
each position. There are some differences between the hit sequences from the human VH 

25 germlines and those from Kabat-derived human V H sequences: amino acids of non- 
human origins resulting from amino acids that are structurally important to stabilize the 
scaffold of the target antibodies etc. This filtered variant profile can be further screened 
computationally to reflect the ranking order of the structural compatibility if only the 
antibody structure is used. In short, using different database of human origin for 

30 framework optimization would provide diverse but powerful choices of amino acids for 
framework optimization including humanization with improved binding affinity and 
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stability. With the increase in our knowledge in developing therapeutic antibodies, more 
and more antibody sequence data will be accumulated and guide our design using present 
invention. No prior assumption is needed to assume the key positions and amino acids 
associated with those positions. Because this information is revealed automatically using 
5 present inventive method, it will become better defined with increase in their occurrence 
in database as more data are accumulated. Variants can be re-profiled or prioritized to 
include other potentially beneficial mutants using structure-based criteria. The structure- 
based energy scoring provides another way to re-profile the occurrence of variants at 
each position for the hit variant library which was originally built based on profiling of 

10 sequences selected from protein databases. Some filtering using frequency cutoff can 
reduce the combinatorial sequences that need to be evaluated by computational screening 
or targeted directly by experimental libraries. Even with the cutoff applied to the variant 
profile, there is still a large number of combinatorial sequences that needs to be scored 
and evaluated in the final sequences for experimental screening. 

15 A structure-based scoring is applied to screen the hit library and its combinatorial 

sequences that form a hit variant library. Side chains of V H FR123 of the anti-VEGF 
antibody in 1CZ8 or 1BJ1 were substituted by ro tamers of corresponding amino acid 
variants from the hit variant library at each residue position by using the program 
SCWRL® (version 2.1) using backbone-dependent rotamer library (Bower MJ, Cohen 

20 FE, Dunbrack RL (1997) JMB 267, 1268-82). The scoring was done using energetic 
terms or their combination with a scaling factor for each term after the structure is 
optimized using the optimal rotamers and energy minimization using the Amber94 force 
field in CONGEN [Bruccoleri and Karplus (1987) Biopolymers 26:137-168] in the 
presence and absence of the structure of the antigen VEGF. 

25 The designed framework VH frl23 have good structural compatibility with the 

structure relative to the murine reference. The human-like features of the framework 
optimization as defined partly by its database were also gauged using the phylogenetic 
distance analysis of the designed sequences with those from human germlines or 
humanized frameworks. 

30 The variant profile from the hit variant library as described above was filtered in 

order to reduce the potential library size while maintaining most of the preferred residues 
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obtained from a hit variant library after eliminating amino acids with occurrences lower 
than the cutoff value and/or by screening sequences based on their compatibility with the 
structural scaffolding. 

The hit variant library constructed above was targeted with a degenerate 
5 oligonucleotides. The degenerate nucleic acid library constructed above was cloned into 
a phage display system and the phage-displayed antibodies (ccFv) were selected based on 
their binding to immobilized VEGF coated onto 96-well plates. The library was installed 
into a phage display vector pABMD12 in which the V H of anti-VEGF was replaced by 
the library. As a result, V L and a variety of V H generated from the library would pair to 

10 form a functional ccFv of anti-VEGF. The phage display library was then used for 
further panning against immobilized VEGF protein antigen. 

In order to generate a library that can cover such a wide range of scattered 
distribution of degenerative positions, multiple overlapping degenerative DNA 
oligonucleotides were synthesized with degenerative positions at the sites where the 

15 library was designed. The assembly process consisted of two PCR reactions, assembly 
PCR, and amplification PCR. The assembly oligos were designed with 35-40mers and 
overlapped by 1 5-20 bases with melting temperature of about 60°C by average. One 
additional pair of amplification oligo primers (Amp93 and Amp94) was created for final 
amplification of the designed products. Accordingly, the assembly PCR includes: equal 

20 amount of the assembly oligo primers in a final total concentration of 8 uM, dNTP of 0.8 
uM, Ix pfu buffer (Strategene), and 2.5 units of pfu turbo (Strategene). The thermal 
cycle was performed as follows: 94°C x 45", 58°C x 45", 72°C x 45" for 30 cycles and a 
final extension of 10 minutes at 72°C. The PCR product mix was diluted 10 folds and 
used as the template for the amplification PCR in which all reagents were remained the 

25 same except for addition of the amplification primers at the final concentration of 1 uM. 
The thermal cycle was performed as follows: 94°C x 45", 60°C x 45", 72°C x 45" for 30 
cycles and a final extension of 20 minutes at 72°C. The final product (the VH library) 
was purified, digested with Hindlll and Styl, and finally subcloned into vector 
pABMD12 to replace the original murine V H . The library was used to electrically 

30 transform TGI competent cells, which were in turn amplified and rescued by helper 
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phage K07 (Amersham) before production of phages of the library at 30°C overnight 
according to standard procedure. 

To screen the library constructed described in the above example, purified 
homodimeric VEGF protein (Calbiochem) was diluted in designated concentration in 
5 coating buffer (0.05 M NaHCC>3, pH 9.6) and immobilized on Maxisorb wells (Nunc) at 
4°C overnight. The coated wells were then blocked in 5% milk at 37°C for 1 hr before 
phage library diluted in PBS was applied in the wells for incubation at 37°C for 2 hrs. 
The incubation mix also routinely contained 2% milk to minimize nonspecific binding. 
At the end of the incubation, the wells were washed and the phages bound were 

10 subsequently eluted by 1.4% triethyl amine before infecting TGI cells followed by rescue 
by K07 helper phage for amplification. To amplify the phages, infected and rescued 
TGI cells were then grown at 30°C overnight in the presence of carbenicilline and 
kanamycin before phage library was harvested. The phages amplified were used as the 
input library for the next round of panning. Meanwhile, individual clones from 5 th 

1 5 panning and on were randomly sampled for phage ELIS A, in which specific binding to 
immobilized VEGF would be confirmed, and demonstrated 100% positives from the 5 th 
to 7 th pannings. Finally, isolated clones grown on plates of 2xYT/carbenicilline (100 
ug/ml)/kanamycin (70 ug/ml) were sampled for sequencing beginning from the 5 th 
panning (P5) to define the hit positions and hit sequences against the design. 

20 Figure 1C shows amino acid sequences of full length V L (as compared with that 

of the humanized V L (SEQ ID NO:l) as in Baca et al. (1997) J Biol Chem 272:10678- 
10684, and mouse anti-VEGF monoclonal antibody (SEQ ID NO:284) as in Kim et al. 
(1993) Nature 362:841-844, Vi/CDR, and Vl/FR of selected anti-VEGF antibodies. 

Figure ID shows amino acid sequences of full length Vh (as compared with that 

25 of the humanized V H (SEQ ID NO:55) as in Baca et al. (1997) J Biol Chem 272: 10678- 
10684, that of affinity-matured V H (SEQ ID NO:56) as in Chen et al. (1999) J. Mol. Biol. 
293:865-881), and that of mouse anti-VEGF monoclonal antibody (SEQ ID NO:283) as 
in as in Kim et al. (1993) Nature 362:841-844, Vl/CDR), V h /CDR, and V H /FR of 
selected anti-VEGF antibodies. 

30 For the purpose of direct comparison with anti-VEGF antibodies generated by 

others, the humanized V L (SEQ ID NO: 1 ) disclosed in the present invention is the same 
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as the V L of antibody Y0317 described in Chen et al. (1999) J. Mol. Biol. 293:865-881; 
and V H (SEQ ID NO:55) disclosed in the present invention is the same as the V L of 
antibody Y0192 described in Chen et al. (1999), supra. The humanized and affinity 
matured V H (SEQ ID NO: 56) disclosed in the present invention is the same as the Vh of 
5 antibody Y0317 described in Chen et al. (1999), supra. 

The selected optimized Vh frameworks also cluster together with the humanized 
V H sequence, very close in phylogenetic distance to the human germline V H 3 family, 
while the murine V H framework is very distant from the optimized V H frameworks and 
human germlines. The phylogenic analysis of the hit sequences against the entire human 

10 immunoglubin repertoire of Vh suggests that they are indeed most closely related to 
human germline family III. 

This supports the conclusion that the present inventive method in designing 
optimized frameworks with fully human or human-like sequences of the optimized 
antibodies, depending on the fine balance between human-like and compatibility with 

1 5 structure template or templates from ensemble structure or structure average. 

Using our inventive methods described, we have discovered numerous heavy 
chain (e.g., SEQ ID NO: 70, SEQ ID NO: 67 and SEQ ID NO: 75) humanized 
frameworks with higher binding affinity upon framework optimization than the parental 
or reference anti-VEGF antibody sequence (SEQ ID NO: 55). This improvement comes 

20 mainly from a larger increase in the on-rate (Ko n ) and small decrease in the off-rate (Ko ff ) 
by framework humanization alone. Figure 2A shows the affinity data of 5 antibodies, 
parental antibody (hABl) and the optimized frameworks (hAB2, hAB3, hAB5) of anti- 
VEGF antibody selected from designer libraries using BIAcore biosensor. The 
measurement was performed by measuring the change of SPR units (y-axis) vs time (x- 

25 axis) when a purified antibody binds its antigen (VEGF) immobilized on the CM5 

biochip at 25°C. Two humanized frameworks hAB2 and hAB3 are ~4-folder higher in 
binding affinity (in single chain format) upon framework optimization than the 
parental/reference anti-VEGF antibody sequence reported in the literature (Presta LG, 
Chen H, O'Connor SJ, Chisholm V, Meng YG, Krummen L, Winkler M, Ferrara N 

30 (1997) Cancer Res. 57, 4593-4599), these two humanized antibodies should have ~2-fold 
higher binding affinity upon humanization than the corresponding murine antibody. 
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2. Selection of Candidate Antibodies 

1) ccFv — a heterodimeric coiled-coil stabilized antibody 

5 The present invention provides a new strategy to stabilize V H and V L heterodimer. 

A unique heterodimerization sequence pair was designed and used to create a Fab-like, 
functional artificial Fv fragment ccFv (US200030027247A1). This sequence pair 
specifically forms a coiled-coil structure and mediates the functional heterodimerization 
of GABAb-R1 and GABAb-R2 receptors. Each of the heterodimeric sequence pair was 

10 derived from the coiled coil domains of heterodimeric receptors GABA B Rl and R2, 

respectively. For the purpose of engineering a heterodimer of Vh and Vl of an antibody, 
the pair of sequences GR1 and GR2, are fused to the carboxy-terminus of V H and V L 
fragment, respectively. Thus, the functional pairing of V H and V L , ccFv (coiled coil Fv), 
is mediated by specific heterodimerization of GR1 and GR2. Recombinant ccFv 

1 5 antibody fragments were expressed with a molecular weight 35 kDa. 

Vh and V L sequences of an anti-VEGF antibody was cloned into a vector that 
expressed two fusion proteins: V H -GR1 and V L -GR2-pIII fusions. The expressed V H - 
GR1 and V L -GR2-pIII fusions are secreted into periplasmic space, where they 
heterodimerize to form a stable ccFv antibody via the coiled-coil domain. To display 

20 ccFv antibodies on the surface of filamentous bacteriophage, the vector above was 

transformed into bacterial TGI cells, which were further superinfected with K07 helper 
phage. 

2) Adaptor-mediated phage display system 

25 In the conventional phage display system, a protein of interest is fused to a phage 

capsid protein such as pill in order to be displayed on the surface of phage. This fusion 
protein will be assembled into phage particles with the wild-type phage proteins provided 
by a helper phage such as K07. A new phage display system named adaptor-directed 
display system as described in US20030104355A1 was used for displaying various 

30 antibody fragments on the surface of phage. In general, an antibody fragment such as 
single chain variable fragment is carried to the surface of the phage particle by a pair of 
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adaptors that specifically form a heterodimer, one being fused with the displayed protein 
in an expression vector and the other being fused with a phage capsid protein in a helper 
vector. Particularly, each of the heterodimeric sequence pair, GR1 and GR2, was derived 
from the coiled coil domains of heterodimeric receptors GABAb Rl and R2, 
5 respectively. For the purpose of displaying an antibody fragment, the sequences GR1 
was fused to the carboxy-terminus of an antibody fragment in an prokaryotic expression 
vector, whereas GR2 was fused to the amino terminus of the capsid protein III of 
bacterophage genome. The heterodimer is formed via a sequence pair specifically forms 
a coiled-coil structure and mediates the functional display of the antibody fragment upon 
10 rescue of the E.coli carrying the expression vector by the helper phage. 

3) Preparation of candidate antibody libraries 

DNA of libraries of candidate antibodies was prepared based on PCR assembly 
using standard PCR procedure. The DNA was then restriction digested, purified, and 
15 ligated into an appropriate vector as described above. After ligation, DNA was 

transformed into TGI cells. Phages were prepared from TGI cells by a helper phage 
infection. The infected TGI cells were grown in 2xYT/ Amp/Kan at 30°C overnight. 
The phagemid particles were precipitated by PEG/NaCl from culture supernatants, and 
resuspended in PBS. 

20 

4) Selection of candidate antibody libraries 

The phage libraries of the candidate antibodies were used for library selection 
against immobilized VEGF. Purified recombinant human VEGF165 was purchased from 
Calbiochem (cat. No: 80054-994). The glycosylated, disulfide-linked homodimer of a 

25 165 amino acid residue variant of human VEGF has an apparent molecular weight of 42 
kD (Burke et al. Biochem. Biophys. Res. Commun. 207:348 (1995); Neufeld et al. Prog. 
Growth Factor Res. 5: 89 (1994); Leung et al. Science 246:1306 (1989)). In general, 
after incubation period of binding, unbound phages were washed away and bound phages 
were eluted and amplified for the next round of panning. Similar procedures used in 

30 general for phage display have been illustrated in the past (Barbas et al., Phage display: a 
laboratory manual, Cold Spring Harbor Laboratory Press, 2001). 
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For example, an aliquot of 100 ul of 2ug/ml purified human recombinant 
VEGF165 was first immobilized onto each well of a 96-well plate. After blocking with 
5% milk in PBS, an aliquot of the library phages in 2% milk/PBS was added into the well 
and incubated. The phage containing solution was then discarded, and the wells were 
5 washed. Bound phages were finally eluted with 100 mM triethylamine, and were added 
to TGI culture for infection. The phages prepared from infected TGI cells were 
consequently used for the next round of panning. Positive clones were then confirmed by 
ELIS A against VEGF antigen protein, in which the phages bound to the immobilized 
antigen were detected by incubation with HRP-conjugated anti-M13 antibody against 

10 phage coat protein pVIIL The substrate ABTS [2,2'Azino-bis(3-ethylbenzthiazoline-6- 
sulfonic acid)] was used for measurement of HRP activity. DNAs of those clones that 
were positively confirmed were sequenced. DNA sequences sampled were then 
translated to amino acid sequences. Selected sequences were combined to generate new 
variants. Selected variants were expressed to produce soluble antibody fragments for 

15 affinity evaluation. 

3. Amplification of V k and V| from Human PBLs 

The total RNA was extracted from human peripheral blood lymphocytes (PBLs), 
and was used as a template for first-strand cDNA synthesis by using olig-dT primer and 
reverse transcriptase in a standard procedure (first strain cDNA synthesis kit was 
purchased Roche Applied Science). The antibody light-chain variable genes were 
amplified from the single stranded cDNA by PCR. The PCR primers were designed and 
synthesized based on Kaba V-gene database (table 1). In order to amplify all 6 subfamily 
of Vk genes, 6 degenerate primers targeted the 5' end of the 40 germline human Vk 
genes and 3 degenerate primers targeted the 3' end of the 5 human Jk segment genes 
were used. For the PCR amplification of all 10 subfamily of VI genes, 8 degenerate 
primers targeted the 5' end of the 31 germline human VI genes and 2 degenerate primers 
target to the 3' end of the 4 Jl segment genes were used. The PCR amplified Vk and VI 
genes were further cloned into a phagemid vector carried a humanized anti-VEGF 
antibody VH to generate phage display library. 



20 



25 
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Ta 


3le 1 : Primers for Vk and VI amplification 


Vk 


ATTAATGGATCCGMCATCCRGWTGACCCAGTCTCC 


ATTAATGGATCCGATRTTGTGATGACYCAGWCTCC 


ATTAATGGATCCGAAATWGTGWTGACRCAGTCTCC 


ATTAATGGATCCGACATCGTGATGACCCAGTCTCC 


ATTAATGGATCCGAAACGACACTCACGCAGTCTCC 


ATTAATGGATCCGAAATTGTGCTGACTCAGTCTCC 


VI 


ATTAATGGATCCCAGTCTGTGYTGACKCAGCC 


ATTAATGGATCCCAGTCTGCCCTGACTCAGCC 


ATTAATGGATCC.TCCTATGAGCTGACWCAGCyAC 


ATTAATGGATCCTCTTCTGAGCTGACTCAGGAC 


ATTAATGGATCCCTGCCTGTGCTGACTCAGCC 


ATTAATGGATCCCAGCYTGTGCTGACTCAATC 


ATT AATGG AT C C C AG S C TGTG C TG AC T C AG C C 


ATTAATGGATCCAATTTTATGCTGACTCAGCCC 


ATTAATGGATCCCAGRCTGTGGTGACYCAGGAG 


ATTAATGGATCCCAGGCAGGGCTGACTCAGCC 


Jk 


TTAATTGCGGCCGCTTTGATYTCCASCTTGGTCCC 


TTAATTGCGGCCGCTTTGATATCCACTTTGGTCCC 


TTAATTGCGGCCGCTTTAATCTCCAGTCGTGTCCC 


Jl 


TTAATTGCGGCCGCTAGGACGGTS AS CTTGG 


TTAATTGCGGCCGCGAGGACGGTCAGCTGGG 



5 

4. Expression of Soluble Antibody Fragments 

Soluble antibody fragments in the format of single chain variable fragment (scFv) 
can be generated in prokaryotic (E. coli) and eukaryotic (yeast) expression systems for 

10 the purpose of biophysical analysis. Construction of scFv includes a VH fragment and a 
VL fragment connected by a linker of (648)3 as described in previous studies ((Barbas et 
al., Phage display: a laboratory manual, Cold Spring Harbor Laboratory Press, 2001). 
Expression vector used for prokaryotic expression is illustrated in Figure 7. Competent 
bacterial cells, e.g., BL21, were prepared and transformed with a vector that carries the 

15 antibody fragment according to methods known in the art (Sambrook, Fritsch and 

Maniatis, Molecular Cloning: A Laboratory Manual, 2nd Ed., Cold Spring Harbor Press, 
Cold Spring Harbor, N.Y., (1989); Antibodies: A Laboratory Manual, Cold Spring 
Harbor Laboratory, 1988). The transformed cells are cultured under conditions suitable 
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for protein expression. Such conditions are well known to artisans in the field and hence 
are not detailed herein. The expressed antibodies are harvested using conventional 
methods known in the art and used for further analysis. Expression in yeast was 
performed using Pichia expression kit purchased from Invitrogen and according to 
5 manufacturer's instruction. All antibodies were tagged with a HA-His6 tag at C- 
terminus, and purified by NTA and Superdex 75 columns. In order to determine the 
purity and expression yield of antibody fragments, 20 ul of purified proteins are analyzed 
by SDS-PAGE gel, and visualized by staining with Coomassie Brilliant Blue R-250. 

10 5. Affinity Analysis of Selected Antibody Fragments 

BIAcore Biosensor Assay was used to determine antibody affinity. VEGF protein 
(purchased from Calbiochem) was coupled to a CM-5 biosensor ship by amine coupling. 
After immobilization, CM-5 chips with 200 to 1000 response unites of VEGF were kept 

15 at 4°C before use. All experiments were performed at 25°C or 35°C. Each sample in PBS 
buffer was injected over VEGF surface at a flow rate of 20 ul/min using Biacore 1000 
(Biacore AB, Sweden), and bound antibodies were removed from chip by 10 ul of 
glycine-HCl, PH 1.5 at the end of each cycle. Each sensorgram was recorded and 
normalized to a PBS base line. To determine the antibody affinity including association 

20 and dissociation rate constants, the sensorgrams were analyzed by binding curve fitting to 
1:1 Langmuir binding model using BIAevalvation version 3 software. 

6. Evaluation of Stability of Antibody Fragments 

25 In order to evaluate stability of the defined antibody fragments, soluble antibodies 

in PBS were incubated in 4°C, 37°C, and 42°C, respectively. The antibody concentration 
was 0.5uM or luM. After indicated times of incubation, antibody solutions were set at 
25°C for 1 hr, then directly injected over VEGF surface in Biacore 1000. The antibody 
binding RU was recorded, and used for stability analysis. 

30 
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